
Indiast Agrtcultueae 
Research Institute, New Delhi. 


RA.R.L6. 

MGIPC—S4—10 AR—21-6-49—1,006. 





PROCEEDINGS 

OF THE 

ROYAL SOCIETY OF LONDON 


SERIES A. 


MATHEMATICAL AND PHYSICAL SCIENCES 


VOL 183 


LONDON 

Printed and published for the Royal Society 
By the Cambridge University Press 
Bentley House, N.W.i 


18 June 1945 



PRINTED IN GREAT BRITAIN BY 
WALTER LEWIS, M.A. 

AT THE CAMBRIDGE UNIVERSITY PRESS 



CONTENTS 

SERIES A VOLUME 183 
No. A 95)2. 10 August 1944 

An investigation of the scattering of high-energy particles from the eyelotron by 
the photographic method. 1. The experimental method. By J. Chadwick, 
F.R.S., A. N. May, T. G. Pickavanco and C. F. Powell .... 

Long duration of the Balmer spectrum in excited hydrogen. By Lord Rayleigh, 
F.R.S. (Plate l). 

The thermal decomposition of acetone. By J. It. B. Smith and 0. N. Hiuahol- 
wood, F.R.S. 

On the flow of gas through a nozzle with velocities exceeding the speed of sound. 
By L. Fox and R. V. Southwell, F.R.S. 

The production of cosmic-ray bursts by mesotrons. By 0. B. 0. Mohr and G. H. 
Stafford . . . . *. 

Applications of the photographic method to problems in nuclear physics. 11. The 
scattering of 8-8 and 13 MoV neutrons by protons. By F. C. Champion and 
C. F. Powell. 

On the equation of state for solids. By R. Fiirth. 

Continuous spectra in flames: the rolo of atomic oxygen in combustion. By A. G. 
Gayclon. (Plate 2). 


No. A 993. 30 November 1944 

Relaxation methods applied to engineering problems. X. The graphical repre¬ 
sentation of stress. By 1). N. de G. Alien and R. V. Southwell, F.R.S. 

On the theory of point-particles. By H. J. Bhabha, F.R.S. and Harish-Cbandra . 

On the removal of the infinite self-energies of point-particles. By Harish-Chaadra 

The combination of fibrous proteins with acids. II. The adsorption of dye anions. 
By G. A. Gilbert. 

The transition effect of penetrating showers. By L. Janossy and G. D. Rochester 

The barometer effect of penetrating showers. By L. Janossy and G. JD. Rochester 

Some aspects of the production of mesons and the baromotor effect of penetrating 
showers. By L. Janossy. 

Statistical thermodynamics of mixtures with zero energies of mixing. By E. A. 
Guggenheim. 

Statistical thermodynamics of mixtures with non-zero energies of mixing. By 
E. A. Guggenheim. 

The initiation of long electrical discharges. By C. E. R. Bruce .... 


PAGE 

1 

26 

33 

38 

54 

64 

87 

111 

125 

134 

142 

167 

181 

186 

190 

203 

213 

228 












iv 


Gonimis 

> 22 February 1945 

J t’AUlfi 

Anniversary Address by Sir Henry Dale, O.M., G.B.E. ..... 243 

Relaxation methods applied to engineering problems. XL Problems governed by 
the *quasi-plane-poteiftial equation 9 . By D. N. de G. Allen, R. V. South- 
well, F.R.S. and Gillian Vaisey . ..258 

Unitary representations of the Lorentz group. By P. A, M. Dirac, F.R.S. . . 284 

An investigation of the performance of the Rayloigh disk. By R. A. Scott. . 290 

A note on the Hamiltonian theory of quantization. By T. S. Chang . , . 310 

The occurrence and properties of molecular vibrations with V(x)=ax i . By R. P. 

Bell../ . .328 

Absorption of radium (B-fO) gamma-rays. By J. E. Roberts .... 338 

No. A 995. 18 Jufte 1945 

The normal vibrations of bridged X 2 7 6 molecules. By R. P. Bell and H. C. 

Longuet-Higgin s.357 

The dissociation energies of GO, N a , NO and CN. By A. G. Gaydon and W. G. 

Penney.374 

The structure of polyisoprenes. I. The crystal structure of gcranylamiue hydro¬ 
chloride. By G. A. Jeffrey.> . 388 

Do*. 

The statistical theory of the strength of bundles of threads. 1. uy M. B. Daniels 405 

Stationary electron swarms in electromagnetic fields. By D. Gabor . . , 436 

The electrical conductivity of an ionized gas in a magnetic field, with applications 

to the solar atmosphere and the ionosphere. By T. G. Cowling . . # 453 

Index. 4$l 








An investigation of the scattering of high-energy particles 
from the cyclotron by the photographic method 

I. The experimental method 

By J. Chadwick, F.R.S., A. N. May, T. G. Pickavance and C. F. Powell 

(Received 14 September 1943) 

The advantages to be derived from a successful application of the photographic method of 
detecting nuclear particles to the study of tho scattering of the high-energy beams of par¬ 
ticles provided by the cyclotron are discussed. The design of a ‘camera’ for this purpose is 
described. The most important feature of the method lies in the fact that a single plate, 
exposed to the scattered particles for a few minutes, contains the information required for a 
complete examination of the variation of the scattered intensity in the angular range from 
10 to 170° with the direction of the primary beam. A very great saving is thus effected in the 
time the cyclotron is occupied in making such experiments as compared with other methods 
of detection. 

The theory of the method is discussed in some detail, and tests of the performance of the 
instrument are described which demonstrate its reliability. It is shown that the angular 
resolving power of the method is of the order of ± 2°, and that its ability to distinguish different 
groups of scattered and disintegration particles from range distribution curves is superior to 
that hitherto achieved with counter and other methods of detection. 


A. DESCRIPTION OF THE METHOD 
1. Intkoduction 

The investigation if the scattering of charged particles by atomic nuclei is still one 
of the most po' °ul hods of determining the nature of the nuclear forces, and 
many experiments hav. been made during the past few years using artificially 
accelerated protons and deuterons in addition to the a-particles from the natural 
radioactive elements. As a result it has become clear that it is important to extend 
these investigations to the high-energy particles which can be produced only with 
the cyclotron. Such experiments are attended, however, with considerable diffi¬ 
culties if counter or expansion-chamber methods of detecting the scattered particles 
are employed. 

The technieal difficulties arise from the following causes. In order to make 
experiments in conditions of good geometry, which, as will be shown later, are highly 
desirable if the finer details of the scattering processes are to be elucidated, it is 
necessary to define a narrow beam of particles from the cyclotron with a restricted 
angle of diyergence. The currents then available are at present rather small; of the 
order of 10~ 7 to 10 -8 A. Secondly, it is desirable to use thin targets in order to be 
able to study the different scattering processes in conditions where the primary beam 
is homogeneous in energy. Otherwise, with thick targets, it is difficult to determine 
the variation of the efficiency of a particular process with the energy of the primary 
beam and, further, the particles produced in a nuclear reaction with a definite energy 
release do not appear as a homogeneous group but have a spread in energy corre- 

[ 1 ] 


Vol, 183. A. (10 August 1944) 


1 



2 J. Chadwick, A. N. May, T. G. Picka vance and C. F. Powell 

sponding to the variation in the energy of the primary beam in traversing the target. 
In such conditions it is not, possible to resolve different groups and to study them 
individually. With this second restriction the yield of scattered and disintegration 
particles becomes inconveniently small. 

The low yield of scattered particles is particularly serious in work with primary 
beams of fast deuterons when counter methods of detection are employed, because 
they have to be recorded against a high background of impulses produced by the 
heavy general neutron flux from the cyclotron tank (Pollard & Watson 1940 ). As 
a consequence of this background the resolving power of the method for groups of 
different range is restricted even when using proportional counters in coincidence 
for the detection of the scattered particles. It has been pointed out by Bower & 
Burcham ( 1939 ) that the detection of the scattered particles by the expansion 
chamber allows high resolving power without serious restriction of solid angle, and 
Schultz, Davidson & Ott ( 1940 ) have used the method to detect the scattered 
particles resulting from the bombardment of neon by a beam of deuterons with an 
energy of 3-8 MeV. Whilst an improved resolving power was thus obtained the 
method is too laborious to allow a detailed examination of the variation of the 
scattered intensity with angle to be attempted. 

It seemed clear to us that the photographic method of detection offered very great 
advantages in dealing with this problem. It has been shown (Powell, May, Chadwick 
& Pickavance 1939 , 1940 , 1943 ) that the method is reliable for the determination 
of the energy and the intensity of homogeneous groups of particles, and for the 
present purpose it has the great advantage that the particles scattered through 
different angles can all be recorded at the same time, thus eliminating the necessity 
of monitoring the primary beam. Since the scattered particles of all energies and 
at all angles of scattering are recorded simultaneously, the time for which the cyclo¬ 
tron is occupied in making a single exposure is enormously reduced compared with 
that required for the same investigation using the other methods of detection. 

' Further, the single exposure gives a permanent record of many hundreds of times as 
many tracks as are required for a satisfactory investigation of the variation of the 
scattered intensity over the complete angular range from 10 to 170° with the 
direction of the primary beam. Most of this wealth of experimental material is 
normally not employed, but any part of it can be used if particular regions of the 
distribution of the scattered particles show interesting features which it is desirable 
to examine in detail. 

In a previous note (Powell et al. 1940) we have given a preliminary account of the 
results of experiments on the scattering of protons which show that these advantages 
can be realized, our conclusions in this respect being substantially the same as those 
of Wilkins & ELuerti (1939,1940) and Wilkins (1941). In this and subsequent papers we 
give a detailed account of the methods employed and the results obtained for the 
• scattering, by twelve different elements, of 4 MeV protons and 7 MeV deuterons, 
produced by the Liverpool cyclotron. The great economy which the method effects 
in the operation time of the cyclotron is illustrated by the fact that the exposures 



3 


An investigation of the scattering of high-energy 'particles 

from which the present results were obtained were all made in two periods each of 
3 days. It will he shown that even in work with deuteron beams the proton recoils 
produced by the neutron background give rise to no serious difficulties, and that the 
experimental width of homogeneous groups of particles appears to be considerably 
less than that achieved hitherto in similar experiments using counter methods of 
detection. We would emphasize that these papers are designed to display the power 
of the method and to make a rough survey of the field which is thus opened for 
investigation. They do not constitute a finished investigation, but the results enable 
us to show the most suitable reactions for a much more detailed study in which 
experiments will be made over a wide range of energies of the primary particles, and 
we shall make these experiments when a suitable opportunity occurs. 


2. Description op the apparatus 

Wilkins ( 1941 ) has described a camera for use with the cyclotron in which sixty-six 
photographic plates are adjusted radially about a scattering foil, each plate recording 
particles scattered at a single angle with respect to the primary beam. Sixty-six 
separate shutters were employed, each of which could be opened and. shut during 
exposure to give a suitable density of tracks on the corresponding plate. We have 
preferred to use'a camera with a single large rectangular plate, and to study the 
scattering, when possible, from gas ‘targets’. The essential features of the method 
were described in our previous note. 

In our design a defined beam from the cyclotron passes down the axis,of a tube 
parallel to, and just above the plane of the rectangular photographic plate. This tube 
has an interruption th rough which some of the particles scattered by the gas filling the 
camera can escape and enter the photographic emulsion at a glancing angle to the 
surface. The essential features of this arrangement are shown in figure 1. It has the 
advantage that a single plate contains information on the scattering from 10 to 
170°, providing regions at any angle of scattering in this range, in which the density 
of tracks is convenient for counting purposes. 

The stray field of the cyclotron magnet causes a considerable curvature in the 
path of a beam of ions emerging from the target chamber. The scattering apparatus 
is therefore placed about 2 ft. from the pole-piece of the magnet, where the stray 
field is relatively weak, and the beam is brought to it down an evacuated tube, 
shown in figure 2, which is bolted on to the face of the target chamber. This tube is 
bent to follow the estimated path of the beam, and has a section of tombac tubing 
T v so that.it can be moved while the cyclotron is running, -until the spot formed by 
the impact of the beam emerging from the mica window s 2 on a fluorescent screen 
has the maxim u m brightness and is uniformly illuminated. The entrance stop, is 
12 mm. in diameter and limits the angular divergence of the beam leaving s 2 , which 
is 3 mm. in diameter, to 1°. The intermediate stops are designed to prevent particles 
scattered at the walls of the tube from reaching s 2 . The tube is evacuated, through 
s-l and the target chamber, by the main pumps. 



4 


J. Chadwick, A. N. May, T. G. Pickavance and 0. F. Powell 

The beam, 3 mm. in diameter and with a divergence of 1°, emerges from the stop 
s and enters the tube B (figure 2) which has an internal diameter of 15 mm. JLhis 
tube ftttn be adjusted by the tombac joint T 2 until the beam emerges centrally from 
the far end as shown by the illuminated patch .which it produces on a fluorescent 
screen. When these adjustments have been made, the screws controlling the move¬ 
ment of the tombac joints are locked in position so that the position of tube B is fixed. 



Figure 1. Perspective drawing to show disposition of main components of the apparatus. 



Figure 2. Bent tube and slit system for the definition of the 
beam in the stray field of the magnet. 








5 


An investigation of the scattering of high-energy particles 

The scattering apparatus is shown in figure 3. Tube B is a sliding fit on a guide 
tube carried by the front wall of the camera box. The ‘camera’ is mounted on 
levelling screws on a carriage moving on rails, not shown in figure 3. The rails are 
first adjusted to be parallel to the axis of tube B. The carriage is then placed on the 
rails and the levelling screws defining the disposition of the camera are adjusted until 
the axis of the guide tube on the camera coincides with the axis of the tube B. The 
camera can then be moved so that the tube B enters the guide tube and moves 
forward until it reaches a position defined by a stop. When this operation has been 
completed there is a gap of 3 mm., at the point g, figure 3, between the end of the 
tube B and the opposing edge of the co-axial tube C. The edges of the two tubes are 
chamfered externally at the gap g to a feather edge at an angle of 5°. 

Particles scattered from the beam by the gas filling the camera can emerge through 
the gap g to enter the emulsion of the plate, 11x4 in., which is carried by a plate 
holder sliding in grooves on the floor of the camera box. The surface of the emulsion 
is 14-5 mm. below the level of the axis of the tube B and parallel to it. The first stop 
carried by the tube B, which is 2 mm. in diameter, gives the final definition of the 
beam. Subsequent stops are clear of the beam and prevent particles scattered by 
the mica window or by the edges of the stop from reaching the gap g and passing 
through it. A thin aluminium foil, with an equivalent thickness of 1 cm. of air, 
covers the gap and prevents light generated by the passage of the beam through the 
gas from fogging the plate. Copper foils, with an equivalent thickness of 60 and 
120 cm. air, can also be rotated into the path of the scattered beam for use in experi¬ 
ments in which it is desired to examine only the high-energy protons arising from 
disintegration reactions, when a deuteron beam is being used to bombard certain of 
the light elements. The disposition of these foils is shown in section in figure 3 d. 

An ion collector inside the target chamber and just in front of stop (figure 2) is 
used to measure the total beam current and to indicate resonance between the 
magnetic and high-frequency fields of the cyclotron. Only a small fraction of the 
total beam current, which was generally of the order of 5 x 10~® A, reaches the tube 
B where the beam current is of the order of 10 -8 A. 

It is important that the position of any point on the photographic plate with respect 
to the mid-point of the gap g shall be’known to an accuracy of 0-1 mm. The position 
of the plate carrier in the camera is defined by a stop, not shown in figure 3, which 
comes against the front wall of the camera box. The position of the plate in the 
carrier is determined, after exposure to the beam, by inserting in place of the usual 
dark cover slide an alternative slide which has been drilled with a rectangular net¬ 
work of holes 0-5 mm. in diameter and 1 cm. apart. This perforated slide is brought 
to a standard position in the carrier by means of a mark which is brought to coincide 
with a corresponding mark on the carrier. On exposure to a distant source of light 
for 1 sec. before development a set of fiducial spots is thus imposed on the plate from 
which the position of any point on the plate relative to the gap can be determined. 

The plate carrier is provided with four strong leaf springs which hold the plate 
down at the four corners and prevent relative movement of the plate and its carrier. 



6 


kavance and C. F. Powell 
























7 


An investigation of the scattering of high-energy particles 

This also avoids any bending of the photographic plate. For convenience in the 
microscopic examination the plates are made of glass only 1 mm. thick, and the 
tension in the gelatine may bend such a large thin plate if it is unconstrained. This is 
most likely to happen during the evacuation of the camera before the admission of the 
scattering gas, when the emulsion loses water by evaporation and tends to contract. 

Before making an exposure, the loaded plate carrier is inserted into the camera, 
and the dark slide withdrawn through the rear end which is then covered with a 
brass plate, the joint being made gas-tight with a rubber gasket. The camera box 
is then exhausted and filled with the gas under investigation, through a side tube. 
Care is necessary to prevent the emulsion from drying out too rigorously, since this 
is apt to make the thick emulsions peel off the glass. At the same time it is important 
to have no water vapour in the scattering volume. It is found that a phosphorus 
pentoxide boat at the far end of tube C is adequate to keep the gas in the scattering 
volume dry, the aluminium foil preventing rapid diffusion of water vapour from the 
emulsion into the scattering volume. The camera is charged by successively filling 
with the gas and exhausting to the vapour pressure of water until the concentration 
of air has reached a sufficiently small value. 

For experiments with target elements in the form of thin foils, the tubes B and 0 
are withdrawn to increase the width of the gap between them to 38 mm. A foil 
holder, shown in figure 36, is then inserted into the open end of tube B. The holder 
carries the scattering foil stretched across its end, perpendicular to the axis of the 
beam. The holder is cut away to allow the scattered particles to be observed in the 
backward as well as the forward directions. With this arrangement exposures with 
thin targets of aluminium and boron have been made, and from them we can deter¬ 
mine the scattering in the angular range from 30 to 80° and from 100 to 145°. This 
restriction in the angular range is a serious disadvantage, and in future work the 
design of the foil holder will be modified to allow the scattering over the whole 
angular range to be examined, 

3. Geometry of the apparatus 
(a) Determination of intensity of scattered beam 

The essential features of the geometry of the apparatus are shown in figure 4 in 
which h represents the height of the axis of the beam above the plane of the surface 
of the emulsion. Suppose that around a point in the plate corresponding to an angle 
of scattering 6, a small area, A, contains N Kj6 tracks of a particular type K. Let the 
distance along the path of the particles from the centre of the scattering volume to 
the centre of the area A be r. Then the solid angle subtended by A at the centre of 

Ah 

the scattering volume is ~ • The scattering volume is given by la, where a is 
the area cross-section of the beam, and l is the length of the beam which can Be seen 
from A through the gap g . If s denotes the length of the gap, l = where p' and p 

r 



8 J. Chadwick, A. N. May, T. G. Pickavance and 0. F. Powell 


are the distances of A, in the plane defined by A and the axis of the beam, from the 
axis of the beam and from the inner wall of the tube B respectively. The factor pjp 
clearly depends only on the Y co-ordinate of A (see figure 4). It is nearly equal to 
unity and varies only slowly with Y. To a first approximation the puclear cross- 
section per unit solid angle at the angle 6 , <r K 0 , for the process leading to particles 
of type K, may be written 


<r. _ fN KJ) 

K,e N 0 nsah A ’ 


(1) 


where N 0 is the number of primary particles which have passed down the tube B 
in the beam during the exposure, and n is the number of nuclei per c.c. of the gas in 
the camera. Thus the quantity r 3 N Ki g/A, when corrected by the factor p'/p for the 


Figure 4. Geometry of the scattering system. 



variation of the scattering volume with the value of Y at the point at which the 
count of**, is determined, will give the variation of the intensity of the scattered 
beam with, angle. In this analysis the variation of the angle of scattering of the 
particles reaching a given point of the plate from different points in the scattering 
volume, and other corrections arising from the width of the gap g, and from the 
mte cross-section and divergence of the beam have been neglected. We have also 
assumed that the two tubes B and (7 are in perfect alinement with one another, and 
that the edges of the tubes at the gap g are infinitely thin. The experimental tests 
described below show that these assumptions give a sufficiently close approximation 
to the actual conditions. A more refined analysis is given in Appendix 1. 

The^analysis shows that the density of tracks at points in the plate corresponding 
to a given angle of scattering, 6, must be expected to vary as 1/r*. This leads to a 
very rapid variation of density across the plate. Thus at small values of r the plates 
axe visibly blackened. With the exposures normally employed this blackening 




An investigation of the scattering of high-energy particles 9 

extends to a distance of about 2 cm. from the gap at small angles of scattering, and 
the form of this smudge gives a striking picture of the variation of the intensity of 
scattering with angle. Counting can be carried out at values of r ranging from 4 to 
20 cm., corresponding to a ratio of 125 to 1 in density. This variation gives a con¬ 
siderable latitude in the value of the exposure required to produce satisfactory plates 
and allows the scattering at large and small angles to be measured on the same plate 
in spite of the rapid variation in the intensity with angle. The rapid variation of the 
density with r makes it desirable to determine r with an accuracy of the order of 
0*1 mm. Thus with r equal to 5 cm., an error of 0*1 mm. in the determination of r 
gives rise to an error of 1*2 % in the value of the intensity. This is one-half of the 
statistical error arising from the number of tracks which it is usual to count at any 
particular value of the angle of scattering. It is therefore necessary to set the plate 
in the microscope stage, and to determine the position of the part of the plate being 
examined with this accuracy. 


( 6 ) Determination of the angle of scattering 

Primary particles passing down the axis of the tube B and scattered through an 
angle 6 so that they pass through the centre of gap g, trace out a cone which has its 
apex on the axis at a distance B cot 6 in front of the mid-plane of the gap g , where 
B is the internal radius of the tubes B and C. This cone cuts the plane of the emulsion 
in am hyperbola. The hyperbolae for values of 8 at intervals of 2i° have been drawn 



Pigttbe 5. Map of plate showing lines of constant angle of scattering and of constant distance 
from the mean point of scattering. To avoid confusion the rectangular grid of dots is omitted. 


on a large-scale map of the plate together with lines of constant distance from the 
centre of the scattering volume corresponding to any angle of scattering, a point 
which is taken to be the apex of the cone as described above. A simplified map of 
this type is shown in figure 5, and it will be seen that the lines of constant distance 
are not circles, since the position of the scattering volume varies with the angle of 
scattering. 

In the exposures with foils the mean point of scattering is independent of the angle 
of scattering, so that lines of constant distance from this point are true circles on the 



10 J. Chadwick, A. N. May, T. G. Pickavanco and 0. F. Powell 

photographic plate, and the lines of constant angle are hyperbolae concsponding 
to the intersection with the plane of the emulsion of cones with a common apex in 
the foil. A separate map is constructed for the plates exposed with the scattering 
foils appropriate to these geometrical conditions. 


4. Examination of the plates 
(a) Microscope equipment 

The method of processing the plates and measuring the tracks has been discussed 
in previous papers. Ilford half-tone plates were employed with an emulsion thickness 
of 50 or 100 /t according to the range of the scattered particles. After development 
the plates were cut into twenty-four equal pieces, each 1 | by If in., and covered 
with a no. 0 cover-slip of thickness 0-1 mm., mounted on Canada balsam. They were 
examine d with a binocular microscope with a 3 mm. apochromat immersion objec¬ 
tive in conjunction with a x 6 eyepiece. Occasional use was made of x 12 eyepieces 
for special purposes. 

PLATE HOLDER FOR MICROSCOPIC EXAMINATION 



Figuke 6. Plate holder for the microscopic examination of the plates. 

In examining the particles corresponding to any value of the angle of scattering, 
the appropriate section of the plate is fitted in a small turntable, shown in section 
and plan in figure 6 , which can be clamped to the mechanical stageof the microscope. 





11 


A n investigation of the scattering of high-energy particles 

The turntable is rotated so that part of the hyperbola traced oh the plate by particles 
scattered through the angle d will be parallel to the X motion of the mechanical 
stage. The fiducial spot on the plate which lies nearest to the hyperbola for the angle 
of scattering 6 is brought into the field of view of the microscope, and the perpen¬ 
dicular distance from this spot to the hyperbola is read off from the map (see figure 5). 
The Y motion of the mechanical stage is then shifted through this distance so that 
the tracks of particles scattered through the angle 6 lie in the field of view. If the 
X wheel of the stage is now turned the angle of scattering of the tracks under 
observation is not altered. 

(b) Range distribution of the scattered particles 

To determine the distribution in range of the scattered particles, a part of the 
plate is chosen where the density of tracks is convenient, of the order of three or four 
per field of view, and the lengths of the tracks are measured against the eyepiece 
scale. The plate is scanned in the X direction, and the lengths of all tracks which 
originate in a defined central region of the field of view are measured. The speed of 
measurement varies greatly with the complexity of the range distribution of the 
scattered particles, but in simple cases it is of the order of one hundred an hour. The 
measurements are easier than in the case of the recoil protons in the investigation of 
neutron beams, since the tracks are all parallel and they start in the surface of the 
emulsion. 

To convert the range of the scattered particles into cm. of standard air, the stop¬ 
ping power of the emulsion as determined in previous experiments is used; l/i in 
the emulsion is taken to be equivalent to 1-46 mm. air. It is possible that small 
uncontrolled differences in the processes of manufacture may produce small changes 
in the stopping power for different batches of plates of the same type. We have no 
evidence for the existence of any significant differences of this kind and they are 
certainly small. Very accurate measurements of the energy of the particles have not 
appeared to be necessary in the experiments so far undertaken, and we have not 
thought it necessary to determine the stopping power of every batch of plates. The 
observed range must be corrected for the air equivalent thickness of the aluminium 
foil covering the gap and of the gas between the point of scattering and the point 
of observation on the plate. 


(c) Intensity measurements 

To measure the intensity of the scattered particles at a particular angle of scat¬ 
tering;, in cases where the scattered particles are homogeneous in energy, the density 
of the tracks is determined by setting the plate as above, and turning the eyepiece 
scale until it is perpendicular to the tracks. The mechanical stage is then moved 
slowly in the X direction through, say, 4 mm., whilst the tracks crossing the scale 
are counted. The X motion is then made to return to its original setting and the Y 
motion is moved through a distance corresponding to the width of the eyepiece scale 
(0*1 mm.), and the plate is again scanned in the X direction through 4 mm. These 



12 J. Chadwick, A. N. May, T. G. Pickavance and C. F. Powell 

*■ 

operations are repeated until the number of tracks counted at this particular angle 
is sufficiently great to reduce the statistical error to the deshed value. With the 
exposures normally used the variation in the angle of scattering over an area con¬ 
taining 1000 tracks is negligible. The value of r, the distance from the centre of the 
scattering volume to the centre of the scanned area, is read off from the map with an 
accuracy of OT mm. Local small irregularities in the surface of the gelatine may cause 
errors in the determination of the intensity by changing the angle of incidence of 
the tracks on the surface of the emulsion. To avoid these errors at least four areas 
are scanned in the way described, at different values of r. 

The above methods are satisfactory when the density of tracks is of the order of 
200 per mm. 2 . For higher densities a x 12 eyepiece is used instead of the usual x (5, 
and only those tracks are counted which cross the central region of the eyepiece scale. 
For the highest densities the x 12 eyepiece is used in conjunction with a graticule 
giving a rectangular grid in the field of view such as is used for blood counts. With the 
plate stationary, a map is made of the field of view from which the number of tracks 
starting in a definite area can be counted. The plate is then moved and another area 
is mapped in the same way. This method is very laborious, but the usual method is 
uncertain at high densities because the tracks sometimes cross one another and are 
difficult to disentangle, and it is sometimes useful to be able to deal with areas of 
high density. 

When the scattered particles at a particular angle are not homogeneous, the above 
methods may still be employed if the tracks belonging to the different groups can be 
distinguished by inspection without measuring the length of each one. If, however, 
the range distribution is complex it becomes necessary to measure the length of 
every track starting within a definite area of the plate. Figure 7, for example, shows 
the distribution in energy of the particles from oxygen and nitrogen bombarded 
with 7 MeV deuterons. The two lower-range groups in the first case are sufficiently 
resolved, and can be counted without the measurement of the individual tracks, 
whereas in the second case the length of every track must be determined. 

The nature of the particles producing the different peaks is given in the caption 
to figure 7, the identification of the different groups as protons, deuterons and 
a-particles having been confirmed by grain counts in a way to be described in a later 
paper. The results show the satisfactory resolving power achieved which appears to 
be considerably superior to that obtained with most of the best counter methods 
employed hitherto with primary beams of fast deuterons (see, for example, 
Humphreys & Watson 1941 ), and equal to that given by the expansion chamber 
(Schultz & Watson 1940 ). 

At large values of r, the angles of incidence of the scattered particles on the 
surface of the emulsion become smaller. As a result there are a number of tracks 
which suffer small-angle deflexions towards the surface and leave the emulsion before 
coming to the end of their range. Such tracks can usually be distinguished in that 
they have both ends in the surface of the emulsion. If the particles are homogeneous 
in energy this effect leads to an apparent increase in the straggling of the range 



An investigation of the scattering of high-energy particles 13 

distribution curve, but does not affect the determination of the intensity of the 
scattered beam. The effect is most serious with slow particles and rapidly diminishes 
in importance as the energy increases, so that with the faster particles smaller angles 
of approach can be tolerated. 

At angles of scattering less than ~ 10° the finite thickness of the chamfered edge 
of the tubes B and C leads to a decrease in the effective scattering volume which 
depends on the precise form of the edges of the tubes, and the observed density of 
tracks becomes correspondingly unreliable. With the apparatus in its present form, 
measurements cannot be made, therefore, below 10° and above 170°. 



Figure 7. Particles from the bombardment of the light elements by 7MeV deuterons: 
(a) oxygen at 45°, (6) nitrogen at 105°. The observed distribution in range has been trans¬ 
formed to a distribution in energy on the assumption that all the particles-are protons. The 
peaks mar ked ‘d’ are due to elastically scattered deuterons. The peaks P x and P 2 are due to 
protons from the reaction O 10 (d, p) O 17 , the O 17 being left in the ground and first excited states 
respectively. P 3 , P 4 , P 5 and P 6 are protons from the reaction N 14 (d, p) N 15 , the N 15 being 
left in excited states. The peaks oc v a 2 and a 3 are a-particles from the reaction N 14 (d, a) C 12 , 
the C 1 * being left in the ground or excited states. 


B. TESTS OF THE METHOD 
5. General 

A purely qualitative examination of the plates shows that some of the experi¬ 
mental diffi culties have been overcome. In working with protons we have found no 
recoil tracks in the emulsion due to neutrons generated by the large ion currents in 



14 J. Chadwick, A. N. May, T. Gf. Pickavance and C. P. Powell 

the tank, nor do the associated y-rays and low-energy neutrons produce any appre¬ 
ciable increase in the general background of grains against which the tracks have to 
be distinguished. In work with deuterons the plate was shielded from neutrons 
coming directly from the tank of the cyclotron by boxes of wax placed round the 
scattering camera. Nevertheless, we find recoil tracks in appreciable numbers, due, 
we believe, to a localized source of neutrons produced by the impact of the deuteron 
beam on the brass in the neighbourhood of the stop s 2 (see figure 2). In future experi¬ 
ments we propose to cover the brass in this region with a foil of a heavy element. The 
tracks due to particles scattered by the gas or foil must all start in the surface of the 
emulsion and travel in a known direction. Very few of the neutron recoil tracks fulfil 
these conditions so that they do not lead to any confusion. 

The number of tracks in ‘wrong directions’ due to particles scattered by the gas 
and then by the walls of the camera box is extremely small and gives no difficulty. 

6. Angular variation of the energy of elastically scattered protons 

A satisfactory check on certain aspects of the method is provided by an investiga¬ 
tion of the variation with the angle of scattering of the energy of protons, scattered 
elastically by the light elements. The energy, E, of particles of initial energy E 0 and 
mass m, after being scattered elastically through the angle 6 by a nucleus of mass M, 
is given by the equation: 

{M+m)E—2m^j(EE 0 )cos6 — (M—m)E Q = 0.' (2) 

This relation can best be tested by means of the plates showing the scatter ing of 
hydrogen, deuterium and helium. The hydrogen exposure was taken with com¬ 
mercial hydrogen from a cylinder, the camera being filled to a pressure of 1 atm. 
The scattered protons have sufficient energy to penetrate the al uminium foil over 
the gap at angles of scattering less than 60° and give measurable tracks below 50°. 
The density of tracks on this plate is rather high, so that scattering at small anglag 
has to be investigated at the greatest possible values of r. This, taken in conjunction 
with the high gas pressure and the fact that the gas contained a smal l quantity of 
oxygen, causes an unusually large apparent straggling in the observed r ang e dis¬ 
tribution at small yalues of d. Above 25°, however, the proportion of tracks due to 
scattering from oxygen falls to less than 20 %, and the measurements between 25 
and 50° are satisfactory. The results are shown in figure 8 and are in good agreement 
with the relation (2). 

The deuterium used for the first experiments with this gas was very impure, con¬ 
taining both light hydrogen and oxygen in appreciable quantities. As a consequence 
we find in general three resolved groups of tracks at any angle of scatter ing , 6, of 
which the range varies with angle of scattering in the manner shown in figure 9. 
The protons scattered from oxygen vary only slowly with the angle of scattering, 
and the line through the experimental points for this group (curve a) follows closely 
that predicted by equation (2) if if is put equal to 16. Curve b is drawn from the 

i 



15 


An investigation of the scattering of high-energy particles 

results of putting M equal to 2, in equation (2), and the appropriate experimental 
points for protons scattered by deuterium lie on this line within the limits of experi¬ 
mental error. Finally, curve c passes through points corresponding to a third group 
of particles. Curve c is calculated on the assumption that these tracks are. due to 
deuterons thrown forward by the impact of the primary protons, and the agreement 
between the observed and calculated values shows that the assumption is correct. 



energy in MeV 

Figure 8. The energy and velocity of protons scattered by hydrogen at various angles as 
deduced from the observed range of the scattered particles. Circles give experimental points 
—the lines are calculated from the known primary proton energy. 


v 

\ 


energy in MeV 

Figure 9. The variation with angle of the energy of particles from the impact of 4 MeV 
protons on deuterium containing oxygen as an impurity; (a) protons elastically scattered 
from oxygen, (b) protons elastically scattered from deuterium, (c) projected deuterons from 
the impact of protons on deuterium. Circles are experimental points. The lines show the 
expected variation calculated from the known primary proton energy. 






16 J. Chadwick, A. N. May, T. G. Pickavance and C. F. Powell 

Similar experiments with helium as the scattering gas also give close agreement 
between the experimentally observed values and those predicted by equation (2) 
when M is put equal to 4. 

These measurements demonstrate that the observed tracks arise from single 
scattering in the gas and that the effects, on the observed range distributions, of 
multiple scattering and of scattering from the edges of stops, are negligible. They 
give a very direct check on the range-energy relation for the emulsion, and illustrate 
the power of the method by the ease with winch the processes responsible for the 
different scattered groups can be identified. 


7. Variation of track density with distance 

FROM THE POINT OF SCATTERING 

It follows from equation ( 1 ) that the density of tracks for a constant value of Q 
should vary as 1 /r 3 when allowance has been made for the slight variation of the 
scattering volume with the Y co-ordinate of the point of observation. This relation 



Figure 10. Proton-proton scattering. Diagram to illustrate the errors introduced into the 
determination of the scattered intensity due to uneven surface of the emulsion. 

has been verified in a large number of eases. In the case of the scattering of 4 MeV 
protons by hydrogen, experiments were made over a particular large range of r, 
from r = 7-6 to r = 18-1 cm. These measurements were made by two observers 
using the methods already discussed. The results are shown in figure 10 , in which the 





An investigation of the scattering of high-energy particles 17 

corrected value of the quantity r z n/A is plotted against r. It mil be seen that the 
differences from the mean are rather larger than the expected statistical fluctuations 
but do not show systematic change with r. 

8. Accuracy of the intensity determinations 

In the measurements represented in figure 10, and in some other cases, we have 
found discrepancies between the results at a given value of the angle of scattering, 
but at different values of r 3 which are larger than would be expected from statistical 
fluctuations. We ascribe this to irregularities in the surface of the emulsion. If, owing 
to these irregularities, the surface at any point is inclined at an angle 8 then the 
particles impinge at a glancing angle (hjr) + 8 on the surface. The intensity deduced 
from the counts, on the assumption that this angle is hjr , will be in error by a fraction 
8r/h. With 8 = ± |° this leads to errors of 20 % at 20 cm. and 10 % at 10 cm. 

This is about the extreme discrepancy experienced. The error introduced is pro¬ 
portional to r and is shown in figure 10 graphically. To reduce this source of error, 
measurements of intensity are always taken at several different values of r, but the 
probable error of the mean result is still considerably greater than it would be from 
statistical fluctuation alone. 



angle of scattering in degrees 

Figure 11 . Elastic scattering of 4 MeV protons from (a) iodine and ( b ) argon, expressed at 
any angle as a ratio of the observed, intensity to that expected from the Rutherford relation¬ 
ship.- 

Where measurements have been made over a wide range of r and where this is the 
main source of error, the observations should be given a weight proportional to 
1 /V 2 in computing the mean. This leads to a slightly different mean from that deduced 
by weighting the observations according to the number of tracks counted. These two 
means are shown in figure 10. In most of our observations the range of r employed 
is not so great, and a simple mean of the results at different distances may be used. 


Vol. 183. A. 


2 



18 J. Chadwick, A. N. May, T. G. Pickavance and C. F. Powell 

* 

A very satisfactory test of the accuracy of the determination of the intensity of 
the scattered beam and of the general performance of the apparatus is provided by 
experiments on the scattering of protons and deuterons by the heavy elements, 
where we must expect the intensity of the scattered beam to follow the Rutherford 
relationship, I oc cosec 4 0/2. 

Figure 11 shows the results of the experiments on the scattering of 4*0 MeV 
protons by iodine in the form of CH S I vapour, and by argon. The observed intensity 
of the scattering at any angle is plotted as a ratio to that expected if the variation 
in the intensity followed the relation I = K cosec 4 0/2 precisely. The results were 
obtained with an earlier form of the camera in which there was no provision for 
observations at angles of scattering greater than 100°. The scattering is Coulombian 
in the range examined within the limits of statistical error. The reliability of the 
method is thus confirmed. 


9. Angular resolving power op the apparatus and accuracy 
of determination op the angles of scattering 

The angular resolving power of the apparatus as calculated from the geometry 
of the arrangement varies with both r and 0, and is given to a first approximation 
by the equation ^ 

A8 — -radians. (3) 

r 

This gives Ad = 20' at r = 20 cm., 6 = 15° and A6 = 3-4° at r = 5 cm., 6 = 90°. 
It is satisfactory that the resolving power is greater when 6 is small and the variation 
of the intensity with 6 most rapid. As is shown in Appendix 1 (c) these values of the 
resolving power are not attained because of the divergence of the primary beam and 
of the small-angle secondary scattering of the particles by the gas filling the camera, 
as they move from the gap to the emulsion. A very simple and direct demonstration 
of the resolving power attained in practice is provided by experiments on the 
scattering of deuterons by protons. 

It is easy to show that when a deuteron beam is scattered by hydrogen, the 
deuterons cannot be scattered through angles greater than 30°. We therefore expect 
to find a group of particles which disappears at this angle. The details of the intensity 
relations will be. discussed in a later paper, and it will be shown that for an ideally 
sharp angular resolving power we should expect a variation of intensity of this 
group with angle of the type shown in curve c, figure 12. In fact we observe a variation 
according to curve 6. This smearing out of the sharp features of the angular dis¬ 
tribution corresponds to a finite angular resolving power of the order of ± 2°, about 
half of which is due to small-angle scattering in the gas. This is amply sufficient for 
the resolution-of detail occurring in the angular distribution of scattered particles 
from other elements, although it may lead to errors in intensity measurements when 
the intensity is varying rapidly with angle as in Rutherford scattering at small 
angles. This is discussed in Appendix 2 (e). 



An investigation of the scattering of high-energy particles 19 


10. General remarks 

As a result of our experience in the course of the present work, we have made 
certain minor improvements in the apparatus. Experiments with deuterons, to be 
described subsequently, showed the importance of making experiments at a number 
of different values of the energy of the primary beam using any particular scattering 



Figure 12. Distribution in angle of the intensity of, particles from -the impact of 7 MeV 
deuterons on hy dr ogen, {cl) Projected protons. (6) Scattered deuterons as observed, (c) Ex¬ 
pected dis tribution of scattered deuterons as calculated from the observed proton distribution 
assuming infinite resolving power. The discrepancy between the observed and calculated 
deuteron distributions is due to the failure to observe some of the deuterons of which the 
residual range in'the emulsion is small and to the finite resolving power of the apparatus. 


gas. We have therefore made provision for the introduction of stopping foils into the 
primary beam so that the primary energy can be immediately changed to any one 
of a number of values. 

In the apparatus as described, the scattering gas fills the whole camera and is not 
recovered after the exposure. For experiments with rare gases, such ,as separated 
isotopes, provision must be made for the recovery of the gas, and in general a 



20 J. Chadwick, A. N. May, T. G. Pickavance and 0. F. Powell 

sufficient quantity to fill the apparatus to the required pressure is not available. 
It is therefore desirable to replace the aluminium foil hitherto used to cover the gap, ■ 
g, by a mica window so that the collision space between the tubes £ and C can be 
made gas-tight. In order to avoid .pressure differences between the two sides of the 
mica, it is designed to fill the camera with a common gas such as hydrogen at the 
same pressure as the rare gas filling the collision space. 

The new cyclotrons now in course of construction will yield deuteron beams with 
an energy of the order of from 50 to 100 MeV. One must expect new and unforeseen 
problems and difficulties in passing from experiments with 10 MeV to those with 
100 MeV particles, but it seems reasonable to assume that the methods described 
may still be applicable, without major modifications, at considerably higher energies 
than those of which we have experience. 

Two of us (A. N. M. and C. F. P.) have pleasure in acknowledging our indebted¬ 
ness to ^Professor A. M. Tyndall for placing at our disposal the facilities of the 
H. H. Wills Physics Laboratory, University of Bristol, where the construction of 
the camera and the measurements on the plates were carried out. 


APPENDIX 1 

A MORE ACCURATE THEORY OF THE SCATTERING CAMERA 

(a) Effect of the finite cross-section of the beam 

The beam is assumed to be accurately parallel to the common axis of the tubes 
B and G . An element of the beam at a height h above the plate contributes a number 

h 

of tracks at the point of observation A which is proportional to F = —p'/p in the 

notation on p. 8. Introducing co-ordinates X and Y in the surface of the plate as 
shown in figure 4 we have, 


and 


*■ =/W(-£ 2 +p 2 )=pv< i+-£ 2 /p 2 n 


(1 +X 2 jp 2 )i pp' 2 ' 


( 1 ) 


Now we introduce co-ordinates r, <j>, fixing the position of an element of the beam 
relative to the axis of the scattering tube (figure 13). Then using the suffix 0 to 
designate the value of any quantity for an element on the axis, it is easily shown 
that to the second order in r, 


hlh= l+rsmteS-a), p' tpi _ 


( 2 ) 



21 


An investigation of the scattering of high-energy particles 

where oc = tan -1 h 0 /T 0 , B is the internal radius of the scattering tube, and 9 is the. 
angle of scattering. Inserting these values in (1) and developing as far as the terms 
in r 2 we obtain 

F/F 0 = 1+T- sm (^-a)-| - r-^- + 7 — jsm6 — a cos 6 

“'O Po h 0 p 0 

+ (4 C°s 2 ^ - 1)+(3 cos 2 5 - 1) sin 2 0, 

Po ^Po 

where 6 is the angle of scattering* 



Figure 13. Cross-section of scattering tube and photographic plate. 


If we average this over the cross-section of the beam then the first order terms 
depend on the distance of the centre of gravity of the beam from the axis. The first 
term is independent of the co-ordinate X, Y of the point* of observation on the 
plate and therefore does not affect relative intensity determinations. The second 
depends on p' 0 which varies from 5 to 10 cm., so that the extreme error in relative 
intensity measurements will be 2£/100, where £ is the displacement of the centre of 
gravity of the beam from the axis in mm. This reaches 2 % if £ = 1 mm. 

To investigate the effect of the second order of terms we assume that the beam is 
accurately centred on the axis, is circular in cross-section, and of uniform intensity 
across the section. Then we average first with respect to (j> and then r. 

If we average with respect to §5, then the first order terms in r disappear and the 
second order terms become 


7 2 t 2 3cos 2 #—1 


ysma + ^-h- 

Po ■“'Po 


Now average r between 0 and T, the radius of the beam, and we get 


™ T % T 2 

FIF 0 =l- 7 ^jsma+- 


T z 3 cos 2 (9-1 


8 


(3) 


2Vo Wo 

h o' 

Here F n = -4 —, and therefore includes the corrections for the variation of the 
rlp 0 

scattering volume with 7. 



22 J. Chadwick, A. N. May, T. G. Pickavance and C. F. Powell 

The correction, terms in (3) are small. Thus taking typical values, T 0*2, 
B = 0-75, h 0 = 1-45, p 0 , p' 0 = 10, all in cm., the correction terms would not exceed 
0-2 % and may therefore be safely neglected. 

For large values of 9 the scattering occurs at a considerable distance from the 
defining stop so that the beam may be appreciably broadened. Suppose we take 
T = 0-5 cm., then the correction terms amount to 1-2 % at r - 10 cm., and 2-4 % at 
r =s 5 cm. This is small enough to be neglected. 

# 

(b) Effect of the finite width of the gap g 

The above calculation was made for an infinitely narrow gap. -For a fixed point of 
observation the different parts of a gap of finite width will give different values of X 
in equation (1) and we have to replace the value of F for the centre of the gap by 
the mean value over the width, s. This will change the observed intensity by a factor 

X 2 \ * 1 f f5/2 dx s 2 p 2 - 4X 2 

\ 1+ 7) *J-^ X , sy+x*) 2 

o2 

^ 1 4 . ___ (4 C os 2 # — 1)^ 1 -f 10~ 3 at r = 10 cm., s = 0*5 cm. 


(c) Angular resolving power 

There axe several causes of slight variations of the angle of scattering at a fixed 
point of observation on the plate. First the angular divergence of the primary beam 
is 1° as given by the geometry of the stop system. It will be rather more than this 
on account of small-angle scattering in the mica window s and in the gas, but this 
effect is small. Secondly, the width of the gap g gives a spread in angle of the scattered 
beam given by ‘ 

A9 — ^sm6. (4) 


Besides this, however, there is a variation in 6 which is due to the finite diameter 
of the primary beam, and would be present even with an infinitely thin gap. We have 
pJX = tan 6. Now p varies across the beam according to equation (2). If tj is the 
extreme value of r sin^ this leads to an extreme variation of 8 from the value on 
the axis given by 

; cos 6. 


Ad = lf cos * d 


2 XR 


Jl 

2r.fi' 


This will be less important than (4) at all angles greater than d c ~7i 2 j2sR~2 0 . 
This effect can therefore be neglected. 

. Finally, we must consider small-angle scattering in the gas between the scattering 
volume and the point of observation. The root mean square angle of scattering is 
given in degrees by Z^{PL)j2E, where Z is the atomic number of the gas, P the 
pressure in atmospheres, L the length traversed in cm., and E the energy of the 
particle in MeV. This leads to values of the order of 1°,. 



An investigation of the scattering of high-energy particles . 23 

APPENDIX 2 

(a) Variations in the energy of the primary beam corresponding 
to different angles of scattering 

In. experiments with gases the mean point of scattering corresponding to any 
particular angle of scattering, 6, varies with 6. Owing to the stopping power of the 
gas fi l li n g the camera, it follows that there is a slow decrease in the energy of the 
primary beam at the mean point of scattering as 6 increases. Even in unfavourable 
cases, however, this variation is small, and it can be made negligible, when desired, 
by reducing the pressure of the gas in the camera and increasing the exposure time. 

To take a particular example, in one exposure with acetylene at a pressure of 
30 cm. of mercury in the camera, the energy of the deuteron beam on the vacuum 
side of the mica windows (see figure 3) was 7*4. The energy at the mean point of 
scattering varied from 6*7 MeV for 6 = 15° to 6-4 MeV for 6 = 165°, a range of 
0*3 MeV. For this experiment the exposure time was 8 min. with 5 jlcA in the main 
cyclotron beam reaching the collecting plate (see figure 1). In similar conditions 
with 10 cm. of gas in the camera, the exposure time necessary to give the same 
density of tracks would have been 24 min., and the variation in the primary energy 
from 7*06 MeV for 6 — 15° to 6*96 MeV for 6 = 165°, a range of 0-1 MeV. In work 
with deuterons an even greater reduction in the pressure could have been made 
without leading, to an appreciable number of neutron recoils in the emulsion, and 
in work with protons in this range of energy neutron disturbance is negligible. The 
necessary exposure times could of course be reduced by an increase in the main 
currents in the cyclotron. 

(b) Penetration of the edges of the gap 

This would give rise to a tail on the short-range side of any peak in the range 
distribution. The edges of the gap are of course not quite sharp, the tubes B and G 
being feathered externally at an angle of 5° to a thickness of about 0*01 cm. This 
corresponds to about 40 cm. of air, or 27 microscope divisions. This is sufficient to 
stop all scattered particles and most of tjie disintegration protons. Nevertheless, 
at angles of observation other than 90° a certain amount of penetration will occur 
in the way shown in figure 14. Particles which enter the brass at a distance x cm. 
from the edge will go through a thickness x/gos 6 cm. of brass, a thickness equivalent 
to 2700a;/cos 6 microscope scale divisions. Thus if N particles go through the gap of 
width 5 mm. and form the main peak of the distribution, the particles which pass 
through the edge of the gap will give a flat tail with a height of 4V cos 0/2700 tracks 
per microscope division. Thus a peak containing 700 tracks would have a tail with 
one track per microscope division. The observed tail is more intense than this owing 
to straggling in the emulsion and mismeasurement of a few tracks. 

Particles with an initial range greater than 40 cm. of air will be able to pene¬ 
trate the scattering tubes over a greater area. For them a similar calculation 



24 J. Chadwick, A. N. May, T. G. Pickavance and C. F. Powell 

gives a tail of height 4ZV sin (9/270 tracks per microscope division. This will stretch 
■from a range 27 microscope divisions less than that of the main peak down to 
zero range. 



(c) Inhomogeneity and fluctuation in the energy of the primary beam 


This would show up in our. experiments as a broadening of the elastically scattered 
peak. We have not observed any such effectsin experiments with 4 MeV protons and 
conclude that the primary beam is homogeneous and constant in energy within the 
resolving power of the method in this case. With 7 MeV deuterons there is evidence 
for a spread in energy of the primary beam of about 0*5 MeV. 

(d) Effect of stray magnetic field at the scattering camera 

For convenience in working the scattering camera was placed in a horizontal plane. 
The vertical component of the stray magnetic field from the cyclotron will therefore 
cause a curvature of the tracks in this plane. Measurements of the stray field were 
made with a search coil and these show that in the region of the camera the stray 
magnetic field is about ^ of the main field. Since the latter gives a radius curvature 
of 40 cm. in the tank, the radius of curvature of elastically scattered particles in the 
camera will be 8 m. Disintegration particles, or inelasticaUy scattered particles, 
will have a greater or less curvature corresponding to their values of Hp. 

Now the length of path of the scattered particles in the camera is of the order of 
10-20 cm. and in this distance the above curvature will lead to the particle turning 
through an angle of about 1°. The error in the assumed angle of scattering will be i 
of this, or 0;5°, which is less than the error in angle arising from other causes. 

(e) Errors in the measurement of intensity at small angles 

(i) Errors arising from bad alinement. If the axis of the beam does not coincide 
with the axis of the scattering tubes, or if from any other cause there is an error Ad 
in the deterinination of the angle of scattering 6, the rapid variation of intensity with 



An investigation of the scattering of high-energy particles 25 

angle at small angles will give rise to serious errors in the apparent angular distribu¬ 
tion. Thus for Rutherford scattering 1 = const, cosec 4 \d, so that 

AIII=2oobl6Ad = 4Aeid 

at small angles. This would amount to a 40 % error at 20°, for an error of 2° in the 
measurement of d. The best test of this type of error is~the satisfactory agreement 
found with Rutherford scattering. 

(ii) Errors from finite resolving power . A similar but smaller effect will occur 
because the tracks at any point of observation have been scattered through a finite 
range of angles from 6 — Ad to 8 +Ad. Again assuming Rutherford scattering this 
will give rise to an error of the order 4:(Adjd)* at small angles. For Ad — 2° this 
amounts to 2*5 % error at 20°. 

(iii) Finite thickness of the feathered edge, and bad alinement of tubes B and C. If 
t is the thickness of the edges of the tubes at the gap, then the effective length of the 
gap is reduced by an amount t cot 6 for particles passing through the axis of the tube 
and rather more for others. This introduces an error t cot d/s. # 

If the axes of the tubes B and G are parallel but displaced by a distance f in the 
horizontal plane, then the above formula will hold with t changed to t±t\ the sign 
to be taken changing at d = 90°. If t, t' are assumed to be about 0-1 mm. this gives 
of the order of 5 % error at 20°. 


References 

Bower, J. C. & Burcham, W. E. 1939 Proc. Boy. Soc. A, 173, 379. 

Humphreys, R. F. & Watson, W. W. 1941 Phys. Rev. 60, 542. 

Kuerti, G. & Wilkins, T. R. 1940 Phys. Rev. 57, 1081. 

Pollard, E. & Watson, W. W. 1940 Phys. Rev. 58, 12 . 

Powell, C. F. 1940 Nature , Lond 145, 155. 

Powell, C. F. 1943 Proc. Roy. Soc. A, 181, 344. 

Powell, C. F. & Fertel, G. E. F. 1939 Nature , Lond., 144, 115. 

Powell, C. F., May, A. 1ST., Chadwick, F. & Pickavance, T. G. 1940 Nature , Lond., 145, 893. 
Schultz, H. L., Davidson, W. L. & Ott, L. H. 1940 Phys. Rev. 58, 1043. 

Schultz, H. L. & Watson, W. W. 1940 Phys. Rev. 58, 1047. 

Wilkins, T. R. 1941 Phys. Rev. 60, 365. 

Wilkins, T. R. & Kuerti, G. 1939 Phys. Rev. 55, 1134. 

Wilkins, T. R. & Kuerti, G. 1940 Phys. Rev. 57, 1083. 



Long duration of the Balmer spectrum 
in excited hydrogen 

By Lord Rayleigh, F.R.S. 

{Received 19 November 1943) 

[Plate 1] 

The light of a condenser discharge through hydrogen, emitting the Balmer series of lines, 
persists for much longer than current theories and experimental determinations of the time 
of relaxation of the-hydrogen atom would lead one to anticipate. These determinations in¬ 
dicate a time of the order of 10~ 8 sec. for the intensity to diminish in the ratio e: 1. It is shown 
in the present paper that in fact, if hydrogen made luminous by a powerful discharge is blown 
out of the electric field by its own expansion, the time in question is under some- conditions a 
thousand limes more. The discrepancy is not cleared up, but the aim has been to present the 
evidence as clearly as possible for criticism. 

# 

This paper deals with the question of how long the luminosity of hydrogen (Balmer 
spectrum) lasts after the electric discharge has ceased to pass through the gas. 

Results obtained with metallic vapours (Strutt 1914 a,b) suggest that in their 
case the duration is much longer than most theoretical discussions of the subject 
would lead one to anticipate, since mercury vapour, for example, can be made to 
pass in the luminpus condition down long tubes, and it can be shown that the same 
applies to other metallic vapours. 

Theoretical views as to what hydrogen ought to do are much more definite than 
in the case of other elements because of the comparative simplicity of the case from 
the standpoint of wave mechanics. In the present paper the behaviour of hydrogen 
is examined by experiment. Earlier experiments had led to the doubt whether the 
usual view could be sustained (Strutt 1917 ), and the present more definite investiga¬ 
tion has confirmed the earlier one. The experiments appear to indicate definitely 
enough that in this work the fall in intensity in the ratio e: 1 occupies not 10~ 8 sec., 
as found in other investigations, but 10 -5 sec. So enormous a factor (1000 tim es) 
shows clearly that the discrepancy is . fundamental, and cannot be met by any 
criticism on points of detail. 

A summary by S. Bushman of the literature on this subject will be found in Taylor 
& Glasstone ( 1941 ). The first and in some respects the most direct experiments on 
this subject (Wien 1919 , 1921 ) were made by taking advantage of the l umin osity of 
the canal rays, and their velocity measured by magnetic and electrostatic deviation. 
They gave the values 2*3 x 10 -8 sec. for and H y . Slack ( 1926 ) has determined 
the duration of the 2 P state in hydrogen, using excitation with a* voltage of about 
10*2 which could only excite the first line of the Lyman series. Without going into 
details of his method, it may be said that the radiation was observed photo- 
electrically, and that its duration was brought into comparison with the time of 

C 26 ] 



Long duration of the Balmer spectrum in excited hydrogen 27 

oscillation of a high-frequency circuit of 10 7 cyc./sec. He concluded that this in¬ 
tensity fell in the ratio e: 1 in 1*2 x 10 -8 sec. 

Condon & Shortley ( 1935 , p. 135) have given a quantum-mechanical calculation 
of the duration of the various components of the H a line. This line is a complex of 
seven components. These authors give the mean lives (the reciprocals of the total 
transition probabilities) of each level of the three initial configurations as: 


configuration 
35 
3 P 
3 d 


mean life 

16 x 10“ 8 sec. 
0*54 x 10“ 8 sec. 
1*56 x 10- 8 sec. 


I have not attempted to follow in detail the complex train of reasoning by which 
this result is arrived at. It is in general agreement with the time ( 10~ 8 sec.) cal¬ 
culated by the classical (Hertzian) method (Lorentz 1909 ) and seems to be strongly 
supported by the general success of the quantum-mechanical treatment of spectra. 
This quantum theory points clearly to periods not much more than 10“ 8 sec., and 
such experimental evidence as is available points to about the same result. 



The general plan of the present experiments is as follows: The electrodeless 
discharge is produced by induction in a tube having the form of a square ring. This 
tube had an annexe or side tube coming out at right angles to one side and of the 
same bore, through which it was exhausted. The tube is vertical in figure 1 . - 
Each discharge is of very short duration and of high energy, capable of heating 
the small mass of ratified gas to a high temperature with corresponding rise of 
pressure. The gas squirts out into the vertical exit above mentioned, out of the path 
of the current, remaining l umin ous (figure 2 , plate 1 ). It there shows the three 




28 


Lord Rayleigh 

hydrogen lines H a , and H y . By observing the vertical exit tube with a revolving 
mirror the upward progress of the luminosity can be traced and its velocity deter¬ 
mined and the decay of luminosity at various distances along this tube can be 
measured independently. These data make it possible to determine the time in which 
the initial luminosity of the gas decays in any given ratio. 

In detail the arrangements are these. The discharge without electrodes is induced 
in an endless tube about 13 cm. square and 1-5 cm. bore (figure 1). Inside this is the 
primary (exciting) coil consisting of 12 turns or less of thick copper wire, each 10 cm. 
square. This coil is earthed in the middle. Two condensers and a spark gap are in 
series with it, the spark gap being connected to a high-potential transformer. The 
sparks are at the rate of 100 per sec., which is the doubled frequency of the alter¬ 
nating supply. The spark length between balls was 8-5 mm., corresponding to say 
28 kY. The aggregate capacity of the two condensers in series was 7*25 x 10~ 3 /tF. 
This makes the energy of charge 2*94 x 10 7 ergs; thus the output of the condensers 
is 294 W. The power dissipated in the ring discharge was measured by immersing 
the discharge tube in an oil calorimeter. It was found to be about 100 W, say 0*3 of 
the output of the condensers. The rest, no doubt, goes almost entirely into the spark 
in the condenser circuit. The 100 W represents 1 joule or 0*24 cal. at each discharge. 
The discharges are, however, of very short duration, with relatively long intervals 
between. The duration may be examined by means of a rotating mirror, applied 
either to the spark gap in the primary exciting current, or to the low-pressure 
discharge. 

Each discharge, as will be seen, occupies only a very short space of time, and the 
momentary rise of temperature in the gas can be calculated from the measured 
energy of a discharge combined with a knowledge of the heat capacity. Thus, taking 
the volume of the tube as 79 c.c. and the hydrogen pressure as 0*2 mm., the mass of 
the gas is 1*88 x 10 -6 g. The specific heat may be taken as 3*4 cal./g. and the thermal 
capacity will be 6*4 x 10~ 6 cal. Thus one discharge will cause a rise of 0*24/6*4 x 10 -6 
degree or 3*76 x 10 4 degrees. A great instantaneous increase of pressure is therefore 
to be expected and the expansion producing the luminous jet is amply accounted for. 

It is possible that some who have had experience of the perverse way in which 
condenser discharges go to earth by any path except where the experimenter wishes, 
may be suspicious that the prolongation of the luminosity into side vessels, as in 
figure 2 (plate 1), is not a genuine effect of transport of a stream of gas having 
residual luminosity, but that it represents an electric current gone astray to earth. 
Eor many reasons I am satisfied that this is not the case. Such stray discharges often 
occur if a single condenser only is used. The stray discharge is in part superposed on 
the genuine effect of the jet of luminous gas. They are, however, easily distinguished 
by several criteria. Stray discharges show a complex spectrum, the so-called 
secondary (molecular) spectrum of hydrogen, whereas the luminous jets only show 
the Bakner lines on a black background, and are dominated by the fiery red of the 
H a line. They are not more than 6 cm. in length. If the channel divides, as in figure 3, 
the fiery luminous jets divide also (figure 4, plate 1), but the stray discharge goes 



29 


Long duration of the Balmer spectrum in excited hydrogen 

down on one side only to the air pump, which forms in effect an earthed electrode. 
It also extends through any length of tubing, in contrast to the luminous jets, which 
only go a short distance. 

The best way of getting rid of these effects is to use Lodge’s arrangement with two 
condensers in series with the spark gap, and the exciting coil in the other sym¬ 
metrical position between the condensers. The exciting coil maybe earthed at the 
middle, though this is scarcely necessary. The stray discharges then disappear, but 
the genuine effect remains. 

If the light from the spark gap is examined, a long train of oscillations is found, 
with little damping, the period being (with one of the arrangements used) 1*29 x 
10 ~ 6 sec. When the secondary discharge started, the long train of regular oscillations 
was reduced to 3 or 4 faintly discernible, by far the greater part of the light being 
in the first discharge or pilot spark. 



If the light of the.induced discharge using nitrogen in the square discharge tube 
is examined, then 1*93 x 10 -6 sec. of total duration, or about 1J periods, is found. 
If hydrogen is used instead of nitrogen a complication ensues. In hydrogen the light 
is persistent owing to the gas remaining luminous, and does not go down to nothing 
at the minima of the oscillating current. No oscillations can in fact be detected, and 
if the mirror turns very fast, as is necessary to resolve the oscillations with nitrogen, 
the light in hydrogen is diluted to the vanishing point, and there is nothing to be 
seen or photographed. By using a rapid (//1*9) lens and a larger mirror a luminous 
trail can be photographed. The initial image with the mirror turning slowly (merely 
to separate the individual discharges) was 0*1 mm. wide. By turning the mirror at 
29 rev./sec., the image was expanded and was traceable for a length of 0*5 mm., 
representing 1*8 x 10“ 5 sec. The images .(enlarged about 23 times) are represented 
in figures 5 and 6 (plate 1). 

This experiment, only carried out with difficulty, seems to give some indication 
of a discontinuity of intensity, the later decay being slower. However, it shows 
clearly enough that the duration is much more than can be reconciled with the 

I 



30 


Lord Rayleigh 

received view that the time is of the order 10~ 8 sec. It is generally confirmatory of 
the more satisfactory experiment described below. 

A point of interest is the very high instantaneous value of the power dissipated in 
the discharge tube. Taking the duration of each discharge of the condenser as 
1*93 x 10~ 6 sec., and the number of discharges as 100 per sec., the discharges occupy 
only 1*93 x 10 -4 of the whole time: and the power dissipated during the progress of 
an individual discharge is 100/1*93 x 10~ 4 W, or 520kW. 

This enormous figure for the power dissipated in a vacuum discharge tube excited 
in this way came as a surprise to me, and is perhaps not generally realized. It helps 
to account for the distinctive character of condenser discharges in spectroscopy. 
Since the duration of 1*93 x 10“ 6 sec. includes 1J complete oscillations, it is evident 
that the true maximum power must be several times that calculated. 

In order to measure the velocity, a slit 0*5 mm. wide was placed in front of the 
tube; this was photographed with a Dalmeyer cinema lens of //1*9 aperture and 
Ilford Astra VII plates which are specially sensitive to H a light. The image on the 
plate was 0*085 actual size. A revolving mirror (silver on glass) was placed in front 
of the lens, it being large enough to fill the lens aperture with light. The rate of 
rotation was 33 rev./sec. The displacement of the image by a given rotation was 
measured directly on the ground glass of the camera, giving the mirror a small 
rotation by means of a distance piece of known thickness. It was found that 
1 cm. displacement of the image corresponded to 0*07 radian or 3*39 x 10~ 4 sec. 
of time. 

The discharges occurring at the rate of 100 per sec., a considerable number of 
them were photographed at each exposure. The direction of the slit was recorded by 
rotating the mirror very slowly through one revolution by hand. Such a slow rota¬ 
tion serves to separate the discharges but does not give any indication of the 
velocity of the gas impelled along the tube by a discharge, which velocity is, in 
these conditions, to be regarded as infinite in comparison. 

Having made a preliminary exposure with slow rotation, the motor is started and 
the mirror run at a standard speed of 33 rev./sec. Another exposure is given, super¬ 
posed on the former one. On development, numerous images from both exposures 
are seen, those at slow speed being almost uniformly spaced, and the others inter¬ 
polated at random among them (figure 7, enlarged, plate 1). 

A specially clear photograph selected from another plate is reproduced in figure 8, 
(plate 1). It is easy to distinguish the exposures made with the mirror running at 
full speed, which are inclined at a small but quite perceptible angle to those made 
with the mirror going very slowly. This obliquity is the basis of measurement. 
Measurement is made to the sharp left-hand edge of the photographed image. This 
edge records the time at which the luminous gas first reaches a given height. The 
obliquity was measured as 4*5° by means of a low-power microscope, making sett ing s 
on the lines by a spider thread in the rotating eyepiece. The result is regarded as 
correct within 0*5°. As the luminosity progresses along the tube the mirror rotates, 
and the image is displaced through an appreciable distance. 



31 


Long duration of the Balmer spectrum in excited hydrogen 

On the scale,of the original photograph 1 cm. horizontal represents 3*39 x 10~ 4 sec. 
of time; 1 cm. vertical represents 11-8 cm. of distance. The observed slope of 4-5° 
is equivalent to 1 cm. vertical for 0*08 cm. horizontal. 

This gives for the velocity with which the luminosity advances 11-8/0*08 x 3*39 
x 10“ 4 or 4*36 x 10 5 cm./sec. (It may be mentioned for comparison that the velocity 
4*36 x 10 5 cm./sec. above calculated would be the velocity of atomic hydrogen in 
temperature equihbrium at 768° abs. or 495° C.) 

The luminosity in the vertical exit tube (figures 1, 2) was examined as follows: 

A plate was prepared with a line of small (2 mm.) holes at intervals of 1 cm. A 
neutral glass reducing the fight nearly in the ratio e: 1 was placed over the first of 
these holes, and this was placed over the entrance of the vertical exit tube (figures 
1 , 2). The second hole, with no opaque glass, was brighter, but the difference was 
less in each successive hole, and at the fifth hole (4 cm. from the first) the unscreened 
luminosity was judged equal to the initial luminosity reduced by the opaque glass. 

It appears then that the luminosity is reduced in the ratio e: 1 after a distance 
of 4 cm. 

At the velocity 4*36 x 10 5 cm./sec. this represents a time interval of 0*92 x 10~ 5 sec., 
or, say, about 10“ 5 sec. It is not asserted that after another 10~ 5 sec. the intensity 
will have again fallen in the same ratio. The indication is rather that the fall will 
be more rapid. 

As will be seen by inspection of figure 2 on plate 1, the luminosity in the vertical 
side branch tapers out to a conical end. This seems to show that it is lost by diffusion 
of the luminous centres to the walls of the tube, when they become extinguished. 
This cause of extinction is superposed upon the decay of luminosity in the body of 
the gas. Owing to the complication introduced by this wall effect, it is scarcely 
feasible to study the exact law of decay by photometric measurements along the 
tube. 

The rate of decay is not exactly the same for the various fines of the Balmer series. 
A greenish blue fringe is sometimes seen near the limit of the exuded luminosity, and 
examination with a direct-vision prism shows that this is due to the tongue of 
luminosity being a little longer in the fight'of than in that of H a or H y . This is 
somewhat strange, as one would expect regular progress along the series. The 
difference of behaviour of the lines is not very marked, and indeed has not 
always been noticeable. I have not determined what are the best conditions for 
observing it. 

It is clear from the above that for the Balmer fines in general at least 10~ 5 sec. is 
required for the luminosity to sink to 1/e of its actual initial value, under the con¬ 
ditions used. This is a thousand times greater than the period (10“ 8 sec.) usually 
assigned from theoretical considerations, and from Wien's experiments on the 
luminosity in the track of positive rays. 

While no attempt is made to account for this, the evidence has been reproduced 
in a way which, it is hoped, will give the reader the opportunity to form his own 
opinion of it, and to weigh it against the theoretical objections. 



32 


Lord Rayleigh 

Experimental work which seems out of harmony with established fashions of 
thought does not easily find its way into books, and is apt to be soon forgotten. The 
writers of text-books and summaries of literature do not know how to deal with it, 
and find the easiest solution is not dealing with it at all. I hardly expect any other 
result in the present case, but it seems a duty to put the observed facts on record. 
It is believed that the photographic evidence which is here reproduced will put the 
reader in almost as good a position to judge of it as if he had carried it out himself. 


References 

Condon & Shortley 1935 Theory of Atomic Spectra. Cambridge. 

Lorentz, H. A. 1909 Theory of Electrons, p. 259. 

Slack, F. G. 1926 Phys. Rev. 28, 1-12. 

Strutt, R. J. (Rayleigh) 1914 a Proc. Roy. Soc. A, 90, 364-372. 

Strutt, R. J. (Rayleigh) 19146 Proc. Roy. Soc. A, 91, 92-103. 

Strutt, R. J. (Rayleigh) 1917 Proc. Roy. Soc. A, 94, 88-99. 

Taylor, H. S. & Glasstone, S. 1941 Physical Chemistry, 1 , chap. n. London. 
Wien, W. 1919 Ann. Phys., Lpz., 60, 597. 

Wien, W. 1921 Ann. Phys., Lpz., 66 , 229. 


Explanation of Plate 

Figure 2. Ring discharge induced in hydrogen at 0*2 mm. pressure. Note the extrusion of 
lu m inosity 6 cm. or so into the tube above. 

Figure 4. Lu min osity extruded into a T-tube instead of the straight vertical tube of figure 2. 
Note how it divides into two streams. 

Figure 5. Image of a slit placed over the vertical exit tube. Hydrogen discharge. 

Figure 6 . Image of a slit in a revolving mirror. Note the drawing out of the image. Length 
of arrow denotes a duration of 1*8 x 10 ~ 5 sec. In nitrogen under the same conditions the image 
is not perceptibly drawn out. 

Figure 7. Jet of extruded luminosity. The longer vertical images, of which there are five at 
equal intervals, are taken with the mirror revolving very slowly. The shorter images inclined 
at 4-5° to the former are taken with the mirror revolving at speed. The slope measures the 
speed with which the front of the extruded luminous jet moves forward as about 4 x 10 5 cm./sec. 

Figure 8. A similar picture from another plate giving about the sa^ae result. 



Rayleigh 


Proc. Roy. Soc., A, volume 183, plate 1 



Figube 7 
















The thermal decomposition of acetone 
By J. R. E. Smith, D.Phil., and C. N. Hinshelwood, E.R.S 
{Received 2 December 1948 ) 

Inhibition by propylene shows that free radiaal chains occur in the thermal decomposition 
of acetone. . 

Quantitative investigations permit of conclusions about the mechanism of the chain pro¬ 
cesses, which are compared and contrasted with those occurring in other decomposition 
reactions. 

A free radical chain mechanism for the homogeneous thermal decomposition of 
acetone was postulated by Rice & Herzfeld (1934). This paper gives direct evidence 
that part of the reaction does in fact proceed by such a mechanism. However, in 
several respects the properties of the reaction, as hitherto known, differ from those 
of unambiguous chain reactions. The reaction rate does not vary with the surface- 
volume ratio (Hutchison & Hinshelwood 1926; Winkler & Hinshelwood 1935), nor 
when free radicals are added by decomposing mercury dimethyl (Rice, Radowskas 
& Lewis 1934); Again the analogous photolysis has a quantum yield of less than one. 
Finally, Staveley & Hinshelwood (1937) found that the decomposition is not sensi¬ 
tive to added nitric oxide, whose radical absorbing property had in other cases proved 
a definite indication of chain mechanisms. This evidence led some to concludelhat 
the disruption of acetone liberates only molecules with saturated valencies. On the 
other hand, Rice & Polly (1938) reported an inhibition of the reaction by propylene, 
which would combine with free radicals; and more recently Rice found more, but 
less definite, evidence of a free radical chain mechanism from the analysis of the 
products at low pressures (private communication). Since useful data concerning 
the decomposition of other compounds have been given by a detailed study of the 
inhibitory effect of propylene (Smith & Hinshelwood 1942), a similar approach has 
now been made to the problem of the mechanism of the acetone decomposition. 

The decomposition of acetone occurs at a measurable rate at temperatures at 
which propylene itself reacts with gradual increase in pressure. Consequently the 
pressure increase of acetone in the presence of propylene bears no constant ratio 
to the amount of acetone decomposed, and the reaction was followed in all cases 
by analyses for acetone using the iodoform method (see Winkler & Hinshelwood 

1935)1 

The rate of the reaction w T as measured by the reciprocal of the time for 50 % 
decomposition, and was found to be considerably reduced by the addition of pro¬ 
pylene (table 1, figure I). This provides unequivocal evidence of the existence of a 
free radical chain mechanism. As with other decompositions the rate is reduced to 
a well defined limit with no evidence of a secondary catalysis even with large amounts 
of propylene. 


Vol. 183. A. 


[ 33 ] 


3 



34 


J. R. E. Smith and C. N. Hinshelwood 


Table 1. The bate of decomposition of acetone in 
THE PRESENCE OF PROPYLENE AT 570° C 


partial 
pressure of 


l/ti x 10 s 


propylene 

50 mm. 

100 mm. 

200 mm. 

300 mm. 

in mm. 

acetone 

acetone 

acetone 

acetone 

‘ 0 

1-64 

1*90 

2*08 

1*98 

2 

1*20 

— 

’— 

— 

5 

0*96 

1*35 

1*60 

— 

10 

0*81 

1*14 

1*42 

1*67 

20 

0*70 

0*96 

1*32 

1*46 

50 

0*60 

0*72 

1*05 

1*21 

100 

0*53 

0*69 

0*91 

1*03 

150 

— 

— 

0*80 

— 

170 

— 

0*58 

— 

0*83 

200 

— 

. — 

0*76 

— 

estimated 

0*50 

0*55 

0*60 

0*60 


limiting rate 



propylene (mm.) 

Figure 1 . Inhibition of the acetone decomposition by propylene at 570° C. {Vertically 
divided circles, 50 mm. of acetone; black circles, 100 mm.; open circles, 200 mm.; horizontally 
divided circles, 300 mm.) 

This appears to be in conflict with the apparent lack of inhibition of the reaction 
by nitric oxide (Staveley & Hinshelwood 1937). Consequently the effect of nitric 
oxide was reinvestigated (table 2, figure 2). An unmistakable inhibition is revealed. 
This is counterbalanced at large pressures of nitric oxide by a catalysis (as observed 
with acetaldehyde, Smith & Hinshelwood 1942). In the simpler decompositions of 
diethyl ether and propaldehyde it was definitely shown that the same limiting rate 
was produced by the two inhibitors, nitric oxide and propylene (Smith & Hinshelwood 
1942). From figure 2 it is reasonable to estimate that with acetone, if it were not for 
the catalysis, the limiting rates for the two inhibitors would again coincide. Thus it 
is probable that both nitric oxide and propylene can largely inhibit the chain re¬ 
action, leaving in both cases the same residual reaction of a non-chain nature. 




The thermal decomposition of acetone 


35 


Table 2 . The rate of decomposition of 100 mm. of 
ACETONE AT 570° C IN THE PRESENCE OF NITRIC OXIDE 


partial pressure 
of nitric oxide 


in mm. 

1 fa x 10 s 

0-0 

1-90 

0*5 

1-85 

1-0 

1-77 

2*0 

1*68 

5 

1*56 

10 

1*46 

20 

1*40 

36 

1*71 



10 20 30 40, 

pressure of inhibitor (mm.) 


Figure 2 . Inhibition of the acetone decomposition at 570° C by nitric 
oxide (black circles), and by propylene (open circles). 

From table 1 and figure 1 it is apparent that the limiting value of the reaction rate 
for large amounts of propylene is nearly the same for different initial pressures of 
acetone. That is, the ma xim ally inhibited reaction is of the first order. Further, the 
normal uninhibited reaction is also of the first order above 100 mm. pressure, as 
confirmed by table 1 (see also W inkl er & Hinshelwood 1935 )* Consequently the 
difference between the rates of these two reactions, which represents the mhibitable 
portion of the reaction, is itself constant with varying initial pressure of acetone. 
Thus the acetone chain reaction is of the first order. 

The amount of propylene required to suppress a given fraction of the inhabitable 
reaction increases the initial pressure of acetone increases. This is shown by 
figure 1, or by table 3 in which is tabulated (p —p^)l(Po~'Pco) against the partial 
pressure of inhibitor for several initial pressures of acetone (p is the rate for a given 
amount of inhi bitor, p 0 is the rate without inhibitor, and p^ is the limiting rate). 
The 'inhibition curves’ are not superimposable. 


3-2 





36 


J. R. E. Smith and C. N. Hinshelwood 


A consideration of the steady state shows that the proposed chain reaction (given 
below) is co nsi stent with the first order of the reaction if the chains are broken 
largely by step (4) rather than the recombination of two like radicals: 

( 1 ) CH 3 COCH 3 ^ 2 ch 3 +;co 5 

(2) CH3+CH3COCH3->CH 4 +CH 2 COCH 3j 

(3) CH 2 COCH 3 ^CH 3 +CH 2 CO, 

(4) ' CHg-f CH 2 COCH 3 -* C 2 H 5 COCH 3 . 

That step (4) is the chain-breaking step would indicate a concentration of CH 2 COCH 3 
radicals comparable with that of CH 3 radicals. However, if the propylene attacks 
the CH 2 CO CH a radical, a steady state calculation shows that the c inhibition curves 9 
would coincide for different initial pressures of acetone. That the e inhibition curves 9 
are widely separated (see table 3) leads to the assumption that the propylene attacks 
the CH 3 radical. Apparently propylene combines more rapidly with the CH 3 radical 
than with more bulky radicals in comparable concentrations. This conclusion was 
also reached from a similar argument in the case of the decomposition of diethyl 
ether (Smith & Hinshelwood 1942 ). 


Table 3. ‘Inhibition-curve 9 data eor acetone 
AND PROPYLENE AT 570° C 


pressure 
of propylene 


(P~Pcc)/(Po 

Pod) 


<■ 

50 mm. 

100 mm. 

200 mm. 

300 mm. 

in mm. 

acetone 

acetone 

acetone 

acetone 

0 

1-00 

1*00 

1*00 

1*00 

2 

0*61 

— 

— 

— 

5 

0-40 

0*60 

0*68 

— 

10 

0-27 

0*44 

0*55 

0*78 

20 

0-18 

0*30 

0*48 ♦ 

0*62 

50 

0*09 

0*13 

0*30 

0*44 

100 

0*03 

0*10 . 

0*21 

0*31 


In several respects the acetone reaction differs from other chain reactions. The 
reaction rate is not sensitive to change in the surface-volume ratio, nor to the free 
radicals from decomposing mercury dimethyl. Radiation cannot induce a quantum 
yield greater than one. Higher temperatures are required to produce comparable 
rates of reaction. Finally, very much more nitric oxide is required to produce 
appreciable inhibition than, for example, with the decomposition of diethyl ether. 
These facts would be explained if the mean chain length in the acetone chain 
decomposition were less than with most of the other decompositions studied. Then 
in acetone relatively more chain-breaking processes would occur while a given number 
of acetone molecules decomposed. Consequently alternative, competing chain¬ 
breaking processes due to inhibitor or surface would be less important, as would also 
be the addition of free radicals from an external source at a fixed rate. Moreover, 
higher temperatures would be necessary to supply a concentration of free radicals 
to maintain the reaction rate. 



The thermal decomposition of acetone 37 

The activation energy of the reaction in the presence of large amounts of pro- 
pylene is 62,600 cal. per mol. (table 4). Although this is 6000 cal. less than the 
activation energy of the normal uninhibited reaction (Winkler & Hinshelwood 
1 93S)? reaction rate is still much greater than would be produced by a simple 

collisional activation in two squared terms. (For example, at 100 mm. pressure and 
600 C the rate of maximally inhibited reaction is 2* 1 x 10 15 mol. cm. -3 sec. - ' 1 , whereas 
ElRT only 7*8 x 10 11 .) Indeed, this is to be expected since the maximally in¬ 
hibited reaction is of the first order. Thus the results for the maximally inhibited 
reaction are still consistent with the hypothesis of an internal rearrangement process 
in which many degrees of freedom contribute to the activation. 

Table 4. The temperature coefficient of the rate of decomposition 

OF 100 MM. OF ACETONE IN THE PRESENCE OF 100 MM. OF PROPYLENE 


temp. °C 

sec. 

615 

193 

600 

372 

585 

686 

570 

1306 

555 

2638 


Table 5. Relative efficiencies as inhibitors of propylene 

AND NITRIC OXIDE IN DIFFERENT DECOMPOSITIONS 


reactant 
acetone 
acetaldehyde 
diethyl ether 
propaldehyde 


ratio of the amouifts 
of C 3 H 6 and NO to 
produce a given inhibition... 
approx. I 
approx. 5 
14 
100 


probable 

radicals 

absorbed 


CH 3 

ch 3 

ch 3 , CH 2 OC 2 H 5 
C 2 H 5 


A comparison of the results for acetone with those for other decompositions shows 
that the relative efficiencies as inhibitors of nitric oxide and propylene depend very 
greatly on the particular decomposition (table 5). It was concluded from kinetic results 
with the decomposition of diethyl ether that propylene, in contrast with nitric oxide, 
combines more readily with the smaller CH 3 radical than with the larger CH 2 OC 2 H 5 
radical. Again from table 5 the advantage of nitric oxide over propylene seems to 
be much smaller in combining with the CH 3 radical than with the somewhat larger 
C 2 H 5 radical. It is perhaps surprising that the effect is as large as observed, but this 
seems the only explanation. 

References 


Hobbs 1938 Proc. Roy. Soc. A, 167, 456. 

Hutchison <fc Hinshelwood 1926 Proc. Roy . Soc. A, Hi, 245. 
Rice & Herzfeld 1934 J. Amer. Ghem. Soc. 56, 284. 

Rice & PoHy 1938 J . Chem. Phys. 6 , 273. 

Rice, Radowskas & Lewis 1934 J. Amer . Chem. Soc . 56, 2497. 
Smith & Hinshelwood 1942 Proc. Roy. Soc . A, 180, 237. 
Staveley & Hinshelwood 1937 J . Chem. Soc. p. 1568. 

Winkler & Hinshelwood 1935 Proc . Roy . Soc. A, 149, 340. 



On the flow of gas through a nozzle with velocities 
exceeding the speed of sound 

By L. Box, D.Phtl. and R. V. Southwell, F.R.S. 

(Received 16 December 1943-) 

An iterative method outlined by Green & Southwell ( 1944 ), but not applied in their paper 
for the reason that it makes no use of ‘relaxation methods 5 , is here applied to determine the 
supersonic regime for gas flowing irrotationally through a convergent-divergent nozzle, i.e. 
that ( uni que) regime in which the pressure and density of the gas decrease continuously in its 
passage from end to end. 

Osborne Reynolds's approximate treatment of the problem ( 1886 ) assumed the velocity to 
be distributed unif or mly over each cross-section, and in consequence found a unique value 
for the limi ting mass-flow, whether the velocity be subsonic or supersonic downstream of the 
* throat * (upstream it is always subsonic). Here, a more exact treatment shows that the super¬ 
sonic value is very slightly (0*045 %) greater than the subsonic value, which Reynolds's theory 
overestimates by 0*083 %. The two regimes of course imply two different values of the pres¬ 
sure at exit, and for intermediate pressures (on the assumptions of this paper) there is no 
solution of the problem. 

Even in the subsonic regime (when the mass flow is critical) velocities exceeding the local 
speeds of sound are attained in regions adjoining the nozzle walls near the throat. 


Introduction 

1. Osborne Reynolds (1886) showed that the discharge of gas through a conver¬ 
gent-divergent nozzle cannot exceed a certain limiting value, dependent on the 
starting conditions, however far the back-pressure is reduced. His treatment 
disregarded the effects of curvature of the stream-lines, assuming the velocity to 
be uniform in any section. On that understanding the limiting mass flow is attained 
when in the smallest section (or ‘throat’) the gas has a velocity equal to the local 
speed of sound. 

In these circumstances (as was shown by Reynolds) either of two regimes can occur 
in sections downstream of the throat: either (a) after attaining its maximum expan¬ 
sion at the throat the gas may again contract (with increase of pressure) as it passes 
to the exit section, or (6) it may continue to expand, with a further decrease of 
pressure. No regime is possible of a type intermediate between (a) and (6), and 
accordingly it may be conjectured (by analogy with the theory of equilibrium of 
mechanical systems) that one or other of these two regimes is unstable. 

2. When an attempt is made to improve the theory by a treatment which makes 
allowance for curvature of stream-lines, even in the case of laminar (two-dimen¬ 
sional) flow the problem is very difficult. Taylor & Sharman (1928) avoided its 
purely mathematical difficulties by having recourse to an electrical analogue—the 
flow of current in a conducting layer of varying thickness. By this means they could 
determine flow patterns so long as these entailed no velocity as great as the local 
speed of sound; but at that limit the method failed and its results became divergent 
—thus giving further indication of some form of instability. 

[ 38 ] 



39 


Flow of gas with velocities exceeding the speed of sound 

Si m i l ar experience was reported in a recent paper (Green & Southwell 1944) 
which attacked the same (two-dimensional) problem by Relaxation methods’. 
A technique devised for other problems (‘liquidation 5 of residual errors by a use of 
relaxation patterns 5 appropriate to a regular net) proved successful as applied to 
flows involving no velocity above the local speed of sound; but it too broke down 
as the mass flow approached its limiting value (§ 1), in a manner again suggestive 
of some latent instability. 

3. Now a regime that is unstable may yet be discoverable by calculation, as is 
shown (e.g.) in the theory of compression members (struts) and of ‘free’ stream¬ 
lines; but it is not to be expected that it will be revealed by tentative methods such 
as are described in § 2, because disturbances are implied in any inexact solution . For 
this reason Green & Southwell, in §§11—12 and 22-24 of their paper, outlined an 
alternative (non-relaxational) method, of iterative type, which should serve to 
determine the supersonic as well as subsonic regimes. They did not proceed to 
results, as inappropriate to a paper concerned with relaxation methods. 

This paper uses their alternative method to compute the supersonic flow pattern 
for a closely similar f shape of nozzle, also a limiting subsonic pattern in extension 
of their results. The work was done in part before, but for the most part after. 
Dr Green’s departure for Australia. With one or two exceptions (which are ex¬ 
plained) it confirms his results, and it completes his partial (i.e. subsonic) solution; 
but it leads to two additional and important conclusions, viz. 

(1) that two limiting values of the mass flow in fact replace the single value given 
by Reynolds’s less exact analysis: one is the upper limit under conditions which are 
subsonic both upstream and downstream of the ‘throat’; the other is that value 
for which supersonic velocities occur at all downstream sections. The second value 
is about 0*045 % higher than the first, which Reynolds’s theory overestimates by 
some 0*083 %. 

(2) that at the lower value, while the velocity is subsonic both upstream and 
downstream, it is supersonic in two localized regions adjoining the walls at the 
throat. 

The paper is confined strictly to computational aspects of the problem, being 
intended simply as a supplement to Green & Southwell (1944). Discussion of the 
physical bearing of the work is reserved for a subsequent paper, and on that account 
no reference is made to previous work other than that of Taylor & Sharman (1928), 

I. Outline of the iterative (‘strip integration’) method 

4 . In the earlier paper (Green & Southwell 1944) the gas was assumed to start 
from rest in a reservoir where its pressure is p 0 and its density p 0 , and to move 
without rotation through a nozzle having the shape defined in figure 1 and a size 
defined by its throat width 2 T; the position of any point in the field of flow was 1 


t The reason for the difference in shape is explained in § 11. 



40 


L. Fox and R. V. Southwell 

defined by ‘non-dimensional’ (i.e. purely numerical) co-ordinates x and y, these 
standing for its true (Cartesian) co-ordinates expressed as multiples of T; and the 
total mass flow through the nozzle was denoted by 2M. By conformal transformation 
of the field of flow into a rectangle (§§ 6-8*)f a new system of numerical co-ordinates 
a, /? was devised such that y? = 0 on the centre-line, /? = const. = /3 1 (say) at the 
nozzle wall on the side of y positive (figures 2) j. 



On this understanding, standing for the ‘mass-flow function’ (so that ilr is 
purely numerical), i{r has to satisfy the boundary conditions 

ft = 0 when fi = 0, ) 

= 1 when ft = y? 1: J (1) 

and its governing equation was shown to be 

V 2 (X-ft)-ft.V*x = 0, (19) A* 


(18) A* 


( 2 ) 

(3) 

(4) 

(5) 

( 6 ) 


xS - j ’H(S) !+ (l)I 

and the form of the function F is known. In (2) 

W2_ 32 32 , 

~da z+ dfi z ^ 21 )* 

811(1 T 2 =<P denotes the local density expressed as a multiple of p 0 ; (13)* 
and in (3) h 2 = 


d(cc + ift) | 2 _ /Sa\ 2 /Bay (dBy [dBV 

W+iy)\ w + (aj -(gij +(^j > W 



41 



Figure 2 b 




42 L. Fox and R. V. Southwell 

i.e. h is the modulus of the conformal transformation. The quantity 


/i = Mlp 0 a Q T, (14)* 

where a 0 stands for the velocity of sound in the gas at starting, so that 

®o = YPolPa- ( 7 ) 

With this notation (differing from that of the earlier paper in that the dashes 
there used to distinguish ‘non-dimensional’ quantities are here suppressed) the 
component velocities at any point in the fluid are u and v where 


u dfr 


(2)*, (13)* 


therefore 



i.e., by (5) and (6), 


WY , (of 


when q denotes the local velocity of the gas expressed as a multiple of a 0 . Equation 
(3) can now be written in the form 


I = = F(PY), 

the form of F being calculable from the properties of the gas. Figure 3 shows pq 
and q (= X 2 P%) plotted against x> for air assumed (§ 10*) to start from rest at tem¬ 
perature 15° C and pressure 100 lb./sq. in. (absolute). It was plotted from the 
relation 

pY = 5x~ i { (23)* (10) 

which in § 10* was derived, for these starting conditions, on the assumption that 
air can be treated as a perfect gas (y = 1-4), and which on that understanding may 
be used in place of (3). 

5. The reason for the introduction of conformal transformation was that by using 
a and /? as independent variables, in place of x and y , the boundary is transformed 
into a rectangle, consequently relaxation can be effected on a square net (figure 26) 
having all its meshes complete, i.e. with no 'irregular stars \ The cost of this simpli¬ 
fication is the labour entailed in the determination of of which the values at nodal 
points must be known with accuracy in order that %-values may be accurately 
determinable from (9) and (10). Three-figure accuracy was claimed (§ 15*) for 
Green’s computed values, and in his figure 7 c* h was recorded to five figures, which 
it was claimed (cf. footnote to § 15*) satisfy to this accuracy the requirement that 
log Ji must be plane-harmonic. We shall show (§11) that his claims are not entirely 
justified; but it should be emphasized that the fact had no importance in his subsonic 
investigations, for which three-figure accuracy in h was sufficient. 



43 


Flow of gas with velocities exceeding the speed of sound 

6. This paper does not make use of relaxation methods, but of the iterative ( e strip 
integration 5 ) method which was devised (cf. §19*) to meet the failure of those 
methods as applied to the supersonic regime. We now summarize §§11-12* and 
22—24*. 

Multiplied through by equation (2) can be put into the form 

m)-m\ ** '' 



Figttbe 3 


Now we may expect that contours of in the x-y plane, will approximate closely 
to contours of y?; and this means that the ^-differentials of ijr will predominate in 
(11), so that without serious error we may suppress the ^-differentials to obtain 


i.e. 


d/3) ’ 

X 2 fj~ = const. = A (say). 


(24)* 


( 12 ) 


Equally dfjdft will predominate over df/da.m the expression (9), so that without 
serious error we may suppress the a-differential to obtain 

dijr 






(25)* 


(13) 




44 


L.. Fox and R. V. Southwell 


(Here, in fact, the error is of second order when (di/r/da)l(dijr/dfi) is small.) Now 
eliminating S^/o/J between (12) and (13), we have approximately 

q = pqx* = A/ih, (26)* (14) 


where, by the second of (12) 

^=J^d/?' = AjV 2 d/?'. (27)* 


(15) 


Applied to (15), the boundary condition (1) gives 


i.e. 



(16) 


For our assumed starting conditions, in virtue of (10), (14) may be replaced by 


Aph =q=* pqx 2 = ^{5{1-X~ a '% 


giving 

and hence, according to (16), 


AT 2 = 


^ A'p'hy * 


1=A 



1 - 




(17) 


It should be remarked that the range of q is always limited. Thus 0 <q<*j5 
according to (17). 

7. For every a-line of the rectangular field (§ 5) we have (when the conformal 
transformation has been effected) a known distribution of h. Therefore, for any 
assumed value Ap such that Aph does not fall outside the range of q, we can deduce 
the distribution of q and thence, using (17), of X- Then, evaluating the definite in¬ 
tegral in the second of (16) and inserting the assumed value of Ap, we have a cor¬ 
responding value for p, so can construct a curve of the type of figure 4. p = 0 at 
both ends of the range of Ap, and it has a maximum value (p max say) at some inter¬ 
mediate point. 


8. All of this has been based on the assumption (§ 6) that the exact equation (11) 
may be replaced by the first of (12). For closer approximation, let the solution 
found as above Xi, say) be substituted on the right of (11), and let a second 
approximation (say, ijr 2 , x 2 ) be found from the resulting equation 


d jff\ X d/?J~ da, 



(37)* 


(18) 



45 


Flow of gas with velocities exceeding the speed of sound 

Thereby we obtain, in place of the second of (12), 

X^ = A-F 1 (f), (38)* .(19) 

A as before denoting a constant of integration, and where 

< 41 >* < 20) 

is calculable. The expression (13) holds as before , and on eliminating rjr between it 
and (19) we have, in place of (14), 

q = P?X 2 = Aph-phF^P). (39)* (21) ^ 



Figure 4 


Also from (19) we have, in place of (15), 


ijr = A ^ o J 

and hence, in place of the second of (16), 

p = pAj />1 x~ 2 dj3-pJ /! 'xr*Fi(fi) W- 


(40)* 


( 22 ) 


The procedure,of § 7 is now no longer applicable, r because q is not determinable 
from (21) unless jih as well as Aph is known. But our first approximation was derived 



46 


L. Fox and R. V. Southwell 


e> 

on an assumption which makes F(j5) small, and on that account the new value of 
[i which corresponds with any assumed value of A will differ but little from fi l3 its 
value accor ding to the first approximation: hence we may with sufficient accuracy 
replace (21) and the-second of (22) by 


and by 


q = pqx 2 = Ajih—fahFifJ!) 

/t = Aft 

o J o J J 


(23) 


and then (/i x and F^fi) being known) the construction of figure 4 can proceed as 
before. For our assumed starting conditions, i.e. when (10) is satisfied, the first of 
(22) becomes 


and the second of (22) becomes 

q being given by the first of (23). 


(24) 


Application of the iterative method to determine the supersonic regime 

9. When the supersonic regime is in question, fi must be given its limiting value 
ft QTitu , for which Osborne Reynolds’s treatment yields the approximation 

(5/6) 3 = 0‘578,704 (§10*). 

For reasons stated in § 24* it is of great importance that this value (which is not 
known in advance) be estimated closely, and clearly, for that purpose, s imil ar 
accuracy is needed in the evaluation of h. Whereas three-figure accuracy sufficed 
(§ 5) for Green’s investigation of speeds below the speed of sound, here we shall need 
the five-figure accuracy which was claimed for the values recorded in his figure 7 c*. 

The method used to obtain those values was indicated 'in a footnote to § 15*. 
Having effected the conformal transformation (§5) and deduced three-figure values 
of h by means of (6), Green accepted these values as having five-figure accuracy 
on the boundary (/? = fif), and deduced for every internal point a corresponding value 
of log h, which is plane-harmonic. The underlying idea was that these values, 
although they might not correspond (to five-figure accuracy) with the form bf 
nozzle shown in figure 1, would nevertheless imply a form so near as to be identical 
within the limits of accuracy that are attainable in manufacture ; and on this account- 
no attempt was made tb determine the exact form that they implied. 

This was sound procedure as concerned the computation of subsonic flow r patterns, 
but something more is needed when (as in this paper) we wish to compare with 
Reynolds’s estimate the value given by our iterative method for the critical mass flow. 



47 


Flow of gas with velocities exceeding the speed of sound 


If in fact onr analysis implies a throat width not of 2 T but of some other value 2T, 
then the estimate of /t orlt< according to Reynolds’s theory is not (5/6) 3 , as stated 

above, but (cf. § 10*) ^ = Metl J(p 0 a 0 T) = (5/6) 3 T/!T. (25) 

So, for an accurate comparison, TjT must be accurately computed. 

10. This is a simpler matter than a derivation of ^-values everywhere (§ 12). 
For according to (6), if dfijdx == 0 as is known (from Green’s figure 7 6*) to be true near 
the throat, then to a close approximation 


and so 



r - T St/f 



(26) 


The distribution of h 

11. Here again our accuracy will depend on the accuracy of the accepted ^-values. 
Therefore we are obliged to notice one or two slight errors in Green’s figure 7 c*, 
which would give trouble in a computation of the supersonic regime, although for 
his purpose they are unimportant. 

The boundary values which Green assumed for h (§9) do not (to five-figure 
accuracy) vary smoothly in every part of the nozzle wall: by what must have been 
a slip in computation, a value 100,800 was there given to h at a point where 100,000 
is indicated by a plotting of h against a, and thus a local 'disturbance’ was intro¬ 
duced. This, of course, entailed consequential errors in neighbouring ft-values as 
determined by the requirement that log A must be plane-harmonic. 

Having reason (cf. § 2) to believe that the supersonic regime is in fact unstable, 
we sought to avoid all features in the assumed shape of nozzle which might militate 
against the finding of this regime by computation, and accordingly we decided not 
to retain Green’s shape exactly, but to modify.his assumed ^-values by a preliminary 
smoothing process before proceeding to deduce internal values from the considera¬ 
tion that log A is plane-harmonic. Figures 5 record the resulting ^-values, which 
compare with those of his figure 7 c*. To the extent that they are different, they 
obviously relate to a different form of nozzle; and the problem is thus presented, of 
determining this new form. 

, The nozzle shape 

12. The conformal transformation determines a + ift as a function of x + iy, and 
it therefore follows (in the theory of functions of a complex variable) that the 
differential coefficient d(oc+ij3)/d(x+iy) is a function of either. Moreover, 

d(q + iff) da 8/? _ cf dfi 

d(x + iy) dx dx dy dx ' 

9 = — h{ cos 6+i sin 6) = — he id , (ii) 



48 


L. Fox and R. V. Southwell 


where his the quantity defined in ( 6 ) of §4, and where 6, measuredcounter-clockwise 
from the direction of a;, is the angle made with that direction by the contour/? = const., 
so that 

a/? a/? a/? a/? 

(iii) 


A M • JP 

0 = cos#-^- + sm<9^-, 
dx dy 


, 3 fi rfiP . M 

h = -£- = costf^-sm#^-. 
ds oy ox 





Figure 5 a. h, % and ijr for limiting subsonic regime, with contours of constant h. 



Accordingly he id . and therefore (log h + id), are functions of x+iy and of a + ij3. 
Consequently 

3 d 3 . , dd 3 . . 

us ? (l08i) ’ -p-s (logS) ' < 2 ’> 

and from these relations, given log h, we can determine 6 (its conjugate) by the 
methods of Part V (Gfandy & Southwell 1940 ). Then, finally, x and y may be deduced 
by integration from 

dx dy __ cos 6 dx __ dy __ sin 6 
, 0 a cfl h 5 dj5 3a A 5 


(28) 
















Flow of gas with velocities exceeding the speed of sound 


49 


which are consequences of (i) and (ii). (Where d is small, so that cos 6 = 1, the first 


of (28) implies that 


h = 


M 

dy 


as assumed in § 10.) 

The form of nozzle thu^derived from the ^-values recorded in figures 5 is practic¬ 
ally indistinguishable from Green’s form (figure 1). To fix the value of T/2 7 , L.F. 

rfii 

computed the integral h* 1 dfi (§ 10) for three sections close to the section marked 

o J 

‘throat 5 in figures 2. Table 1 records his results, from which, by normal methods of 
interpolation, it was deduced (i) that a = 28*56 x 10 4 for the true throat section, 
where T/T as given by (26) has its minimum value; (ii) that this minimum value 
is 1*00189. 

Table 1 


OL X 10 ~ 4 
T/T ■ 


26 

1-00612 


28 (‘throat’) 
1-00209 


30 

1*00324 


Substituting this value for T/T in (25), we have according to Osborne Reynolds’s 
approximate treatment 

fi erit = (5/6) 3 x 1*00189 = 0*57980. (29) 


II. Results oe computation by the iterative method 

13. With A-values as recorded in figures 5, computations on the lines of §§6-8 
were made (again, by L.F. with assistance from Miss Gillian Vaisey) to determine 
the value of /£ erit . The means to this determination were outlined in § 24* of the 
earlier paper, which we now summarize. 

For every c section ’ (i.e. contour of ct in figure 56), the iterative method can be used 
to construct a curve of the type of figure 4, in which ji ranges from 0 to some maxi¬ 
mum value /* max .. Only values less than ju m3iX are admissible, and so (since in any 
possible regime [i will have the same value for every ‘section’) the wanted (limiting) 
value of p is the least /£ max> which is discoverable. Clearly it will be the for 
some section near the throat, but it is not certain that this section will be one of those 
that we investigate. However, having values of /£ max . for. three or four values of 
a near the throat, it is an easy matter to deduce (by customary interpolation 
methods) the a-value for which has its minimum value, and thence to compute 
this minimum - Then, accepting the computed value and reverting to the curves 
(of type figure 4) which have been constructed for other sections, it is again an easy 
matter to deduce for each of them a value of A (§6), and with this to compute if 
for the relevant section from (15). 

Two such values of A will be admissible, according as we are concerned with a 
regime of the type of (a) or (6), § 1, i.e. with a subsonic or supersonic regime. Thus 
in figure 4 (on its scale of A/i) 

OM = ji -4 sub.* OF = fi A gTjp.j 


Vol. 183 . A. 


4 



50 


L. Fox and R. V. Southwell 


OM entailing subsonic and ON supersonic velocities at the Section’ considered. 
Upstream of the throat section we must postulate subsonic conditions: downstream 
we may give to A (at every section) its appropriate value either of -4 sub> or -4 sup .. 

Estimation of /t crit for subsonic regimes 

14. Proceeding on these lines, L.F. first calculated to a first approximation (§ 7) 
values of yx max _ for the section marked as 'throat 9 in figures 2, and for the adjoining 
section on either side. His results are recorded in table 2: from them, by customary 
methods, he deduced 

/W = 0*579,60 (i) 

as a first approximation to the critical mass flow, and established that the 
true throat section lies slightly downstream of the central section in table 2 
(a = 28*5 5 x 10 4 ). 


u / 

Table 2 



a x 10~ 4 = 

26 

28 (‘throat’) 

30 

/W. (1st approx.) 

0*582,11 

0*579,71 

0*580,41 

/Wc. (2nd approx.) 

0*582,07 

0*579,43 

0*580,30 


He then proceeded to a second and third approximation in the manner of § 8. 
Giving to jjb his first approximation (i) to /t crit , he deduced second approximations 
to fr and to x for the five central sections, and thence F(fi) for the three central 
sections; then (giving to fa, in (24), the same value 0*579,60) he could obtain new 
estimates of /£ max . for the three central sections, as recorded in the last line of table 2. 
From these a second approximation to the critical mass flow could be deduced as 
before, viz. 

fart. = 0*579,32. (30) 

The whole process was then repeated with this new value 0*579,32, instead of 
0*579,60, given to [i and to fa. No change resulted in the estimate of / 4 crit , and 
accordingly (30) was accepted. 

In § 10* of the earlier paper it was shown that Osborne Reynolds’s estimate (29) 
must be regarded as an upper limit to the correct value, i.e. (since it can hardly be 
exact) as an overestimate . This expectation is realized, since our result (30) is 0*083 % 
less. 

15. Next, giving to fi its critical value as recorded in (30), L.F. proceeded to 
determine (in accordance with § 13) a value of 4 sub> for every section in the range of 
a which is covered by figures 5. Outside that range new computations were not 
needed, for the reason that h is sensibly independent of ft. 

(On a section where h is constant we have, from (14), 

q = Ajih = constant, 

X = const., according to (10). 


therefore 



51 


Flow of gas ivith velocities exceeding the speed of sound 

Consequently (15) and the first of (16) simplify to 

f = 1 = 

whence f = /?/y ? l5 (i) 

as we should expect. Now replacing A by % 2 //? l5 we have from (14) 

fill = (ii) 

in which fi, h , jS t are known. Consequently q and x can be determined separately 
from a diagram of the type of figure 4 or (in our particular problem) from the equation 

5(i -n = ? # = WW 3 (31) 

which comes from ( 10 ) and (ii). 

All of this is based upon the simple theory of §§ 6-7. But no correction comes from 
the amended theory of § 8 , because when ^ is given by (i) then difrjda = 0 and 
so F x (f) = 0 .) 

Determination of the critical subsonic regime 

16. The resulting c subsonic ’ regime is exhibited in figure 6 , which compares with 
figures 8-11 of the earlier paper. Results were found to converge rapidly, no more 
than three cycles of the iterative process being required at any section. 



A feature of the solution is the attainment of velocities exceeding the speed of 
sound in a localized region (shaded in figure 6 ) near the walls of the throat section. 
This has importance in view of a remark by von Karman { 1941 ): 4 It appears that 
in the case of flows that are partially subsonic, partially supersonic, the existence 
of continuous velocity fields is one of the questions of basic importance/ 


4-2 



52 


L. Fox and R. V. Southwell 


Estimation of /i cnt for the supersonic regime 

17. We turn to the more difficult problem of the supersonic regime, which on our 
assumptions will be (as in Osborne Reynolds’s simple treatment) unique . (In prac¬ 
tice, of course, we must contemplate that our nozzle discharges into a reservoir 
maintained at any constant pressure, and a question thus arises as to what happens 
if this pressure is different from what our unique solution requires. That question, 
however, arises also in relation to Reynolds’s treatment,f and it introduces physical 
considerations with which a computational paper has no concern.) 

The first point to be noticed is that ^ crit , determined in the maimer of § 14, has a 
different value according as the velocity is subsonic or supersonic downstream of the 
throat. The simple treatment of § 7, of course, makes the subsonic and supersonic 
values identical, and a like result (with a slight difference in the numerical value) 
follows from Reynolds’s treatment; but the quantity F^/3), which enters into the 
more exact theory of § 8 , is different in the two regimes, and this difference is reflected 
in the value of ju CTit . Proceeding in the manner of § 14, L.F. obtained supersonic 
values as recorded in table 3, which compares with table 2 for the subsonic regime. 
The throat (defined as that‘ section ’ for which /£ max . has its least value) was found to 
be given by 

a = 28*73 x 10 4 , (32) 

i.e. to be slightly further downstream than before; and the accepted estimate of the 
critical mass flow was that given by 


/W = 0*579,58, 


(33) 


which is higher by 0*045 % than the subsonic value (30). 


Table 3 

ax 10” 4 26 28 

(1st approx.) 0*582,11 0*579,71 

/W (2nd approx.) 0*582,03 0*579,68 


30 

0*580,41 

0*580,37 


Determination of the critical supersonic regime 

18. Accepting this value, L.F. proceeded to compute for all ‘sections’ of figure 56 
the supersonic distributions of tjr and jg, with results which are shown in figure 7 . 
Difficulties were confronted which had not been met in our subsonic computations, 
in that the iterative process was found to diverge. There appeared to be a non-zero 
lower limit to the algebraic sum of the nodal values of $[(/}), which limit was attained, 
in general, after two cycles of the process: thereafter the sum increased, much in the 
way that ‘residual forces 5 increase when normal relaxation methods are employed 
on eigenwerte problems with an incorrect value for the characteristic number. In 
our accepted results, nodal values of F ± (j3) were ‘ stable ’ (i.e. did not alter in successive 

f Cf. von Karman 1941 , p. 340. 



53 


Flow of gas with velocities exceeding the speed of sound 

stages) to one part in a thousand of the values of if and and these had like accuracy 
in the sense that a change of that order made matters worse. Higher accuracy seemed 
unattainable, and the impression was gained that although the boundary ft’s had 
been "smoothed’ as far as.was practicable, discontinuities remaining in the implied 
nozzle boundary might be the cause of disturbances large enough to prevent the 
finding of an unstable regime. 

The subsonic regime appeared to be, in comparison, highly stable. Quite large 
F^fiYs were manageable, and values of opposite sign on adjacent strips tended to 
ultimate "stability’, whereas the opposite was found in the supersonic solution. 
Our subsonic solution (figure 6) is believed to be accurate within one part in ten 
thousand. 



CONCLUSION 

19. Our conclusions are as stated in § 3, with the addition that irrotational flow 
(so far as can be judged from the convergence of our iterative process) is stable from 
low velocities up to those entailed in our "limiting subsonic regime’, but is unstable 
(though possible, according to the governing equations) in the unique supersonic * 
regime. 

It may now be said with confidence that any shape of nozzle can be treated in two 
dimensions, and with slight modification our method could be applied to nozzles 
having axial sy mm etry* The labour entailed would of course be somewhat greater. 

Grateful acknowledgment is made of grants from the Ministry of Aircraft 
Production and from the Department of Scientific and Industrial Research which 
permitted, respectively, L. F. to make and Miss Vaisey to assist in the computa¬ 
tions here described. 



54 


L. Fox and R. V. Southwell 


References 

Gandy, R. W. G. & Southwell, R. V. 1940 (‘Part V’) Phil. Trans . A, 238, 453. 
Green, J. R. & Southwell, R, V. 1944 (‘Part IX*) Phil. Trans. A, 239, 367. 
von Karman 1941 J. Aero. Sci. 8, 337. 

Reynolds, O. 1886 Phil. Mag. 21, 185. 

Taylor, G. I. & Sharman, C. F. 1928 Proc . Roy. Soc. A, 121, 194. 


The production of cosmic-ray, bursts by mesotrons 

By C. B. 0. Mohr, JPh.D., Lecturer in Physics, and G. H. Staeeord, M.Sc., 

University of Gape Town 

{Communicated by H. S. W. Massey , F.R.S.—Received 26 August 1943) 

The frequency of occurrence of cosmic-ray bursts under large thicknesses of iron and lead has 
been investigated as a function of the thickness. A definite maximum is found to occur in 
the curves for the smaller bursts, its prominence diminishing with bursts of increasing size. 

The dependence of the burst frequency on the size of the ionization chamber and the 
proximity of the shielding material to the chamber has also been investigated. The results 
suggest the production of showers of mesotrons which spread out, and ultimately give rise to 
electron showers originating in widely separated areas in the material above, and round the 
sides of, the ionization chamber. 

It is now well known that the cosmic radiation reaching the earth's surface com¬ 
prises two main ionizing components, a soft component consisting of fast electrons 
and a hard component consisting of mesotrons. 

The fast electron component is absorbed in a few centimetres of heavy material 
as the result of a cascade process in which each fast electron gives rise to a shower 
of soft electrons. The frequency of showers of a given size rises to a maximum and 
then falls again as the thickness of the material is increased, giving the familiar 
‘Rossi curve 5 . The processes involved have been fairly extensively investigated 
with various arrangements of coincidence counters, and with ionization chambers 
which record bursts of ionization produced by showers, and the main features of 
the observations have been satisfactorily accounted for by the quantum theory 
(Froman & Steams 1938 ). 

The Rossi curve, after an initial fall beyond the maximum, flatte ns out and 
then falls ofi much more slowly, indicating the presence of a harder component, 
the mesotron component, capable of penetrating several metres of heavy material. 
The occurrence of large cosmic-ray bursts under great thicknesses of matter has been 
ascribed to the following process (Christy & Kusaka 1941 ). The mesotro ns produce 
fast electrons, either (a) in head-on collisions with atomic electrons, or ( 6 ) through 
the intermediary of hard quanta produced by retardation of the mesotrons in 
nuclear collisions ( £ Bremsstrahlung 5 ), these quanta subsequently giving rise to 
fast electron pairs. The fast electrons then give rise to showers of slower electrons 
by cascade. 



55 


The production of cosmic-ray bursts by mesotrons 

It was thought for some time that for these larger thicknesses, corresponding to 
the flatter part of the Rossi curve, equilibrium between mesotrons and electrons 
was reached, and that the showers of electrons formed did not extend over large 
areas. Certain observers have, however, obtained indications of the occurrence of 
more complicated processes than those mentioned above. Some, using coincidence 
counters, have found traces of a second maximum in the Rossi curve at about 30 cm. 
of iron and about 17 cm. of lead, though others have been unable to confirm this 
result (for a list of references see Ifereson 1942 ). Swann & Ramsay ( 1940 a), using 
large numbers of self-recording counters arranged in layers, one under the other, 
have determined the rate of occurrence of two-particle showers under different 
thicknesses of tin, and found a second maximum in their curve. Broussard & 
Graves ( 1941 ), using an expansion chamber, have investigated the frequency of 
occurrence of two-particle showers in lead, and obtained a maximum frequency 
under 17 cm. 

The suggestion (Bhabha 1938 ; Heitler 1938 ) that the second maximum in the 
Rossi curve is due to the production of mesotrons in the material receives support 
from more recent calculations (Heitler 1941 ; Wilson 1941 ; Heitler & Peng 1942 ), 
which have shown that there is an appreciable probability of the production of 
mesotron showers by sufficiently energetic primary mesotrons. That such multiple 
processes involving mesotron creation do occur is now confirmed by a number of 
experiments (see Discussion). 

We have observed the frequency of occurrence of bursts under thicknesses of 
iron ranging from 14 to 90 cm., and of lead ranging from 10 to 40 cm., and found 
a pronounced maximum which, it would seem, arises from the production of 
mesotron showers. We have also observed the burst frequency using two ionization 
chambers of different sizes as well as different arrangements of the larger chamber 
and shielding material, and obtained somewhat unexpected results. A mechanism 
is proposed for the explanation of these results as well, in terms of the production 
of mesotron showers in the material. 

Apparatus 

Most of the measurements were carried out with a large cylindrical steel ioniza¬ 
tion chamber of wall thickness 0*3 cm. containing air at a total pressure of 7*2 atm. 
The electrodes were in the form of concentric cylinders of length 57*5 cm., and 
diameter 22*3 and 5*0 cm. respectively, and consisted of two iron wire grids 
soldered to iron-rod frameworks with zinc to mini m ize the "natural effect’. The 
chamber was assembled and used in a laboratory which has always been free of 
radioactive contamination. Some measurements were also taken with a small 
duralumin chamber of wall thickness 1 cm. containing air at a total pressure of 
90 atm. The electrodes, of spun aluminium, were approximately in the form of 
concentric cylinders of mean diameter 10*0 and 4*0 cm. respectively with hemi¬ 
spherical ends. 



56 


C. B. 0. Mohr and G. H. Stafford 

For the first runs the high potential for the electrodes was obtained from 704 
small dry cells, arranged in eleven boxes placed one above the other and carefully 
insulated from each other. After some weeks of hot weather these batteries began 
to dry out, and gave fluctuations of voltage which, though relatively small, were 
for our purpose hopelessly big. It has previously been pointed out (Carmichael 
I 93^) that in this work sudden fluctuations in the potential of more than 1 part in 
10 6 must be avoided. The dry batteries were replaced by 30 new Exide 10 V high- 



tension units, aad these have continued to give the necessary steadiness While the 
smaller potential resulted in a longer time of collection of the ions, this should not. 

affect the variation of the burst frequency with thickness of material above the 
chamber. 

The inner cylindrical electrode was connected directly to the control grid of an 
electrometer valve set up in a balanced circuit (du Bridge 1933 ), and changes in 
the steady plate current were detected with a Moll galvanometer of period 0-2 sec. 
The. galvanometer was shunted with a resistance of 90 ohms to damp irregular 
oscillations of the mirror. The motion of the spot of light reflected from the galvano¬ 
meter mirror was recorded photographically on a sheet of Insurance Bromide, 
U-rade A, wrapped round a cylindrical drum driven by a small synchronous motor 
the dram moving laterally as it rotated on its axis. This camera was designed and 
constructed by Mr J. A. Linton, of the workshop staff. After considerable experience 
had been gamed m identifying very small kicks due to bursts, the drum was 










57 


The production of cosmic-ray hursts by mesotrons 

finally geared down to give a film speed of 7 mm./min., and a single sheet of 
bromide paper then sufficed for a 24 hr. run. A high-resistance leak was connected 
between the control grid and the filament of the valve, so that after the sudden 
kick produced by a burst, the trace on the film returned to the zero line with a 
half-time of about 8 sec. The kicks on the film were subsequently measured with a 
travelling microscope carrying a vernier reading to 0*01 mm., and kicks smaller 
than 0*40 mm. were rejected as falling below the limit of certain detection. While 
the counts may have included small spurious kicks due to statistical fluctuations 
or to groups of contamination alpha-particles, it will be clear from the nature of 
the results obtained that any such kicks formed a relatively unimportant back¬ 
ground of constant magnitude, on which were superposed larger real effects of 
particular interest. 

In order to estimate the sensitivity of the apparatus to shower particles, certain 
constants of the apparatus had to be determined. The capacities of the large and 
of the small chamber, together with that of the shielded lead to the grid of the 
electrometer valve, were found to be 27 and 15 cm. respectively. The mean path 
length of a cosmic-ray particle through each chamber was calculated with the aid 
of a simple mechanical model: this model permitted the direct measurement of the 
path length for a large number of regularly spaced angles of incidence and points 
of entry of the ray into the chamber, and each value was suitably weighted ac¬ 
cording to geometrical considerations and to allow for the drop in number of the 
shower particles with increasing angle of divergence from the vertical (Froman & 
Stearns 1938 ). The mean path length was found to be 25*2 and 9*7 cm. for the large 
and small chambers respectively when placed with their axes of symmetry vertical, 
and 17*0 cm. for the large chamber when placed with its axis horizontal. From 
these data a fairly accurate comparison of the sensitivities of the two chambers 
could be made. 

Only the order of magnitude of the absolute sensitivities of the chambers could, 
however, be determined. In the first place the value of the mean specific ionization 
by cosmic-ray particles is somewhat uncertain: a mean value of 60 ion-pairs/cm. 
path in air at n.t.p. was adopted. In the second place it was not found easy to 
estimate the factor introduced by the amplifying system. The method first employed 
was to find the deflexion produced on the film by applying a given steady potential 
to the control grid of the electrometer valve (Mohr & Stafford 1942 ), but this 
method was later found to be unsatisfactory, probably because it does not closely 
reproduce the experimental conditions which exist when the amplifier is in opera¬ 
tion with the ionization chamber. A more reliable method was adopted, which 
consisted in finding the deflexion produced on the film—actually 13 mm.—when 
a small potential was induced on the collecting electrode of the small chamber as 
a result of suddenly altering the potential applied to the chamber electrodes by 
0*1 V. Account was taken of the fact that the capacity of the shielded lead from the 
chamber to the valve was approximately equal to the capacity of the small chamber, 
and that on the average about one-fifth of the total charge collected by the chamber 



58 


C. B. 0. Mohr and Gk H. Stafford 


electrode has leaked away through the grid leak by the time the collection of the 
ions is complete. It was then calculated that a 1 mm. kick corresponds approxi¬ 
mately to a burst of 90 particles traversing the large chamber, or 10 particles in the 
small chamber, when the axis of each chamber is vertical. 

When investigating bursts under thick layers of iron or lead, a square thick- 
walled enclosure of the same material was built up round the sides of the chamber 
to prevent electron showers from the walls and ceiling of the room reaching the 
chamber. The layer of material directly above the chamber overlapped the walls 
of the enclosure by a few cm. Since the precise geometrical arrangement was 
found to be of some importance, that used in most of the runs with iron is illus¬ 
trated in figure 1. The arrangement of the lead was similar, except that the 
thickness of the enclosure walls was 10 cm. instead of 14 cm. as for iron. 


Results 

The results for the rate of occurrence of bursts of different sizes under different 
conditions are given in table 1. Their most interesting feature is found to lie in the 
manner in which the rate varies with the thickness of iron or lead above the 
chamber. This variation is shown in figures 2 and 3. 

Table 1. Total dumber oe bursts corresponding to kicks of different sizes 


L or S denotes whether the large or small chamber was used; 
V or H whether it was placed with its axis vertical or horizontal. 


Type of chamber 

L 

L 

L 

L 

L 

L 

L 

L 

L 

L 

S 

L 

L 

L 

L 

Axis of chamber 

V 

V 

V 

V 

V 

V 

V 

V 

V 

V 

V 

V 

H 

H 

H 

Shielding material 

Pb 

Pb 

Pb 

Pb 

Fe 

Fe 

Fe 

Fe 

Fe 

Fe 

Fe 

Fe 

Fe 

Fe 

Fe 

Cross-section of en- 

closure (cm. 2 ) 

730 

730 

730 

730 

730 

730 

730 

730 

730 

730 

400 3500 2020 2020 2020 

Thickness of material 

above chamber (cm.)... 

10 

20 

30 

40 

14 

30 

46 

60 

76 

90 

46 

14 

14 

30 

46 

Time (hr.) 

234 

135 

142 

113 

131 

134 

134 

99 

106 

78 

507 

103 

47 

74 

59 

No. of kicks between 

0*4- 0-5 mm. 

114 

100 

93 

55 

93 

99 

86 

97 

122 

85 

116 

51 

102 

101 

72 

0*5- 0*6 mm 

73 

74 

66 

38 

41 

55 

44 

57 

72 

68 

90 

30 

46 

34 

23 

0*6- 0*7 mm. 

46 

35 

36 

28 

25 

21 

34 

24 

55 

18 

53 

5 

21 

10 

16 

0*7- 0*8 mm. 

29 

21 

26 

18 

20 

18 

24 

18 

31 

14 

43 

6 

16 

9 

6 

0*8- 1*0 mm. 

44 

34 

33 

22 

15 

24 

28 

23 

29 

13 

37 

6 

15 

6 

3 

1*0- 1*2 mm. 

20 

16 

14 

15 

15 

11 

19 

7 

9 

13 

37 

_ 

8 

5 

3 

1*2- 1*4 mm. 

16 

12 

15 

8 

10 

10 

6 

5 

* 5 

9 

28 

1 

3 

_ 

2 

1*4- 1*8 mm. 

21 

11 

9 

11 

6 

10 

10 

11 

11 

3 

19 

_ 

1 

3 

1 

1*8- 2*2 mm. 

6 

9 

6 

3 

4 

3 

7 

7 

3 

4 

9 

_ 

1 

1 

I 

2*2- 2*6 mm. 

9 

2 

7 

3 

2 

5 

— 

3 

5 

2 

7 

r -... 

_ 

1 

_ 

2*6- 3*0 mm. 

4 

4 

5 

2 

3 

5 

3 

1 

3 

1 

4 

_ 

1 

_ 

1 

3*0- 4*0 mm. 

4 

7 

3 

4 

3 

1 

— 

1 

3 

3 

6 

_ 

2 

1 

1 

4*0- 5*0 mm. 

3 

1 

1 

2 

— 

_ 

1 

_ 

2 

2 

2 



_ 

1 

6*0- 6*0 mm. 

2 

2 

1 

1 

_ 

_ 

_ _ 

1 

1 


1 



_ 


6*0- 8*0 mm. 

2 

1 

1 

1 

2 

_ 

_ 

1 


_ 

2 


2 

1 

_ 

8*0-10*0 mm. 

— 

1 

_ 

_ 

_ 

_ 

_ 




1 





Over 10*0 mm. 

2 

— 

1 

— 

1 

2 

2 

— 

— 

— 

1 

— 

1 

— 

1 




number of kicks per hour 


The production of cosmic-ray bursts by mesotrons 


59 



_i_i_u_ I _l_J 

30 60 90 

cm. iron above chamber 


Figure 2. Bate of occurrence of bursts in large chamber under iron. 
(The limits of error refer to the standard deviation.) 



Figure 3. Rate of occurrence of bursts in large chamber under lead. 
(The limits of error refer to the standard deviation.) 





60 CL B. 0. Mohr and G. H. Stafford 

It will be seen that pronounced maxima occur for the smallest bursts at about 
20 cm. for lead and 75 cm. for iron, as against the values 17 and 30 cm. respectively 
at which the second maximum is found to occur in the Rossi curves obtained with 
counters. The statistical error of the present results is not small enough to make it 
certain that there is not a slight maximum at 30 cm. in our curves for iron in 
addition to the ma xim um at 75 cm. In this connexion it would have been of 
interest to have extended the curve for lead beyond 40 cm. to see whether there 
was not a further maximum, and it is unfortunate that it was not possible to do 
this with the amount of lead readily available to us. 

While the maxima in our curves are presumably of the same nature as those 
found for two-particle showers by Swann & Ramsay (1940 a) and by Broussard & 
Graves ( 1941 ), our curves for kicks of 0*4-0*5 mm. correspond on the average to 
the passage of about 40 particles through the chamber. 

Light was thrown on this point by taking runs with the smaller chamber under 
46 cm. of iron and comparing the burst rates for the large and the small chamber. 
The comparison is best effected by plotting, as is usual in this work, the logarithm 
of the total number of kic ks greater than a certain size against the logarithm of the 
size of the kick (figure 4 ). The data show that a kick of 0*4 mm. with the large 
chamber corresponds on the average to a burst comprising the same number of 
particles crossing the chamber as a kick of 3*5 mm. with the smaller chamber. We 
then find from figure 4 that kicks of over 0*4 mm. with the large chamber are about 
100 times as frequent as kicks of over 3*5 mm. with the small chamber, whereas 
the ratio of the cross-sections of the chambers is only about 5. Hence the usual 
assumption that the burst rate under large thicknesses is proportional to the cross- 
section of the chamber, which is based in turn on the assumption that the bursts 
are due to fairly narrow showers diverging from a comparatively small area in the 
material above the chamber, is certainly not justified. The greatest caution is 
therefore necessary in comparing experimental data on the absolute frequency of 
large bursts with theoretical values, in attempts to determine the spin or other 
properties of the mesotron (Christy & Kusaka 1941 ). 

Turning again to figure 4, it will be seen that while most of the points lie approxi¬ 
mately on a straight line for each chamber, deviation from linearity occurs for the 
smallest bursts, as shown by the dotted lines. Such deviations are to be expected 
in view of the anomalous variation of burst rate with thickness for the smallest 
bursts. Furthermore, it is reasonable to expect that the point at which the deviation 
begins should in each case correspond to a shower comprising, on the average, the 
same total number of particles in all, N (say), but that the proportion of the rays 
entering the chamber will depend on the size of the latter, if the shower extends 
over a region comparable with the cross-section of the chamber. The bend in the 
curve for the large chamber occurs for a kick of 0*7 mm., corresponding to the 
passage of about 60 rays through the chamber, while for the small chamber the 
bend occurs for a kick of 1*0 mm. corresponding to the passage of about 10 particles 
through the chamber. This indicates that when an if-particle shower takes place 



61 


The production of cosmic-ray bursts by mesotrons 

above either chamber, approximately six times as many of the shower particles 
pass through the large chamber as through the small, and since the ratio of the 
cross-sections of the space enclosed between the electrodes is 5*4, it suggests that 
the shower particles extend over an area at least as large as that of the circular 
section of the large chamber. 

Considering now the observed frequency of occurrence of bursts due to these 
N-particle showers, figure 4 shows this to be 0*80 per hour for the large chamber 
and 0*22 per hour for the small chamber, or a ratio of 3*6. This value lies between 
the ratio, 5*4, of the cross-sections of the space between the electrodes for each 
chamber, and the ratio, 1*8, of the areas enclosed by the iron surrounding the 
chambers in each case, which is a reasonable result. 



Figure 4 . Burst production in chambers of different sizes under 46 cm. iron. Sensitivity of 
large c hamb er, 90 particles per mm. kick; of small chamber, 10 particles per mm . kick. 


The results, then, can be considered as due to burst particles originating, not 
just from one small region in the material above the chamber, but from different 
regions in this material and probably also in the material round the sides of the 
chamber. The part played by the walls was investigated by increasing the horizontal 
cross-section of the iron enclosure round the large chamber to five times the value 
previously used, and at the same time increasing the area of the layer of iron above 
the chamber until it overlapped the walls of the enclosure, while keeping the 
thickness of the enclosure walls and the height of the material above the chamber 




62 


C. B. 0. Mohr and G. H. Stafford 


the same. It was fo un d that the frequency of occurrence of the smallest bursts was 
reduced to about one-half, while that of the larger bursts was reduced to a much 
smaller fraction, only one of the 99 kicks recorded being over 1 mm. in size. 

It is obvious, therefore, that the walls of the smaller enclosure scatter many of 
the shower particles back into the chamber, the more so since the increased burst 
rate due to the greater proximity of the walls more than counterbalances the 
decrease in frequency to be expected from the smaller area of burst-producing 
material above the chamber. As for the largest bursts, their number is so greatly 
increased by the proximity of the walls, that the particles incident on the enclosure 
walls must surely be producing a number of secondary particles in the latter. The 
process may be regarded as one of the whole chain of events starting with a single 
mesotron and ending with burst particles, so that the material of the enclosure 
walls—just as much as the material above the chamber—may be considered as a 
region from which some of the groups of shower particles originate. Further 
evidence for the effect of the size of the enclosure on the burst rate is provided by 
a comparison of the burst rates with the large chamber and with the small chamber 
under 46 cm. of iron. The somewhat smaller enclosure used with the small chamber 
is seen from figure 4 to result in a less rapid decrease in the burst rate with in¬ 
creasing size of burst than with the large chamber. 

The geometrical arrangement of the large chamber with its surrounding material 
was finally altered by placing the chamber with its axis horizontal, surrounding 
the sides and ends with the smallest possible rectangular enclosure of the same wall 
thickness, and placing above it thicknesses of 14, 30, and 46 cm. of iron in turn. 
Again a much more rapid decrease in the frequency of occurrence of the larger 
bursts with increasing size of burst was observed than in the case of the upright 
chamber with its smaller enclosure. It was also seen that the burst frequency— 
thickness curves were similar in form between 30 and 46 cm. (though relatively 
higher at 14 cm.) to those obtained for kicks of the same size when the chamber 
was erect. This indicates that although the mean path length of particles passing 
through the chamber is reduced by a factor of 0*7 when the latter is placed on its 
side, approximately the same total ionization is observed with either position of 
the chamber for showers comprising the same total number of particles, as is to be 
expected if the shower particles extend over a relatively large area. 

Discussion 

The appearance of a maximum in our curves at large thicknesses is a phenomenon 
which must be closely related to that of the second maximum observed by Swann 
.and Ramsay ( 1940 a) using trays of counters for two-particle showers in tin, by 
Broussard & Graves ( 1941 ) using a cloud chamber for two-particle showers in lead, 
and by many others with coincidence counters. There seems little doubt now that 
these results are all due to multiple processes involving the production of mesotron 
showers. 



63 


The production of cosmic-ray hursts hy mesotrons 

Heitler & Peng ( 1942 ) have shown that the probabilities of the production of 
showers of two or more mesotrons by a primary mesotron reach a maximum for 
certain definite values of the energy of the primary mesotrons; these values of the 
energy required to produce a maximum for showers of n particles increasing with n. 
The existence of mesotron showers has now been demonstrated by several ex¬ 
periments. Schein, Jesse & Wollan ( 1941 ), and Carlson & Schein ( 1941 ) have put 
forward evidence for the creation of mesotrons in multiples in the upper atmosphere. 
Swann & Ramsay ( 19406 ), Damy de Souza Santos, Pompeia & Wataghin ( 1941 ) 
and Janossy ( 1942 ) have demonstrated with counters the production of mesotron 
showers in certain elements. Powell ( 1941 ) and Wollan ( 1941 ) have obtained cloud- 
chamber photographs showing the production of showers of several mesotrons in 
lead.* 

The mesotrons composing such showers will give rise to fast electrons at some 
point in their path, and these fast electrons will in turn produce by the usual 
cascade process the electron showers which are observed as bursts under large 
thicknesses. Since the mesotron shower particles may diverge widely, they may 
give rise to electron showers in widely separated regions above and around the sides 
of the ionization chamber. This mechanism would account for the extensive nature 
of at least some—especially the large—electron showers observed with the large 
chamber, while at the same time giving the peaks in the burst frequency—thickness 
curves as characteristic of showers composed originally of a few mesotrons. 

We have seen that the observed rate of production of bursts of different sizes 
depends very markedly on the geometrical arrangement of the chamber and the 
material surrounding it. The same geometrical dependence characterizes the results 
of some of the observers who have investigated the second maximum of the Rossi 
curve with counters (e.g. Schmeiser & Bothe 1938 ). To what extent the prominence 
of the peaks in the burst frequency—thickness curves depends on the particular 
geometrical arrangement is still to be investigated. Further experiments on this 
and other points are being carried out with the small chamber by one of us 
(C. B. 0. M.). 

In conclusion we wish to thank Mr J. A. Linton and Mr 6 . Laing of the workshop 
staff for setting up the pressure chambers and for much valuable assistance. We 
are also indebted to the University of Cape Town for a generous research grant for 
the purchase of apparatus, without which this research could not have been 
undertaken. 

* It has been suggested that these showers may be of the explosive type, but there is no 
conclusive evidence for this. 



64 


C. B. 0. Mohr and G. H. Stafford 

References 


Bhabha 1938 Proc. Roy. Soc. A, 164, 257. 

Broussard & Craves 1941 Phys. Rev. 60, 413. 

Carlson & Schein 1941 Phys. Rev. 59, 840. 

Carmichael 1936 Proc. Roy. Soc. A, 154, 223. 

Christy & Kusaka 1941 Phys. Rev. 59, 414. 

Damy de Souza Santos, Pompeia & Wataghin 1941 Phys. Rev. 59, 902. 
du Bridge 1933 Rev. Sci. Instrum. 4, 532. 

Froman & Stearns 1938 Rev. Mod. Phys. 10, 133. 

Heitler 1938 Ptoc. Roy. Soc. A, 166, 529. 

Heitler 1941 Proc. Gamb. Phil. Soc. 37, 291. 

Heitler & Peng 1942 Proc. Gamb. Phil. Soc. 38, 296. 

Janossy 1942 Proc. Roy. Soc. A, 179, 361. 

Mohr & Stafford 1942 Nature, Lond., 149, 385. 

Nereson 1942 Phys. Rev. 61, 111. 

Powell 1941 Phys. Rev. 60, 413. 

Schein, Jesse & Wollan 1941 Phys. Rev. 59, 615. 

Schmeiser & Bothe 1938 Ann. Phys., Lpz., 32, 161. 

Swann & Ramsay 1940 a Phys. Rev. 58, 477. 

Swann & Ramsay 19406 Phys. Rev. 57, 1051. 

Wilson 1941 Proc. Camb. Phil. Soc. 37, 301. 

Wollan 1941 Phys. Rev. 60, 532. 


Applications of the photographic method to problems 
in nuclear physics 

II. The scattering of 8-8 and 13 MeV neutrons by protons 

By F. C. Champion, King's College, University of London, and 
C. F. Powell, The H. H. Wills Physics Laboratory, University of Bristol 

(i Communicated by A. M. Tyndall, F.R.S.—Received 22 September 1943) 


A method is described for determining the orientation of proton tracks in a photographic 
emulsion with an accuracy of ± 1°. This technical development increases the range of applica¬ 
tion of the photographic method to problems in nuclear physics. The method has boon 
applied to an investigation of the scattering of 8*8 and 13*0 MeV neutrons by protons. 

If 1(0) is the intensity per unit solid angle of the scattered neutrons in the centre of mass 
co-ordinate system of the colliding particles, the measurements give values of 1 ( 0 ) in the range 
from 0 = 7 r to 0 = J 7 T. When this range in the value of 0 is divided into five equal intervals, 
the observed numbers of tracks in the different intervals are consistent with the scattering 
being isotropic in the centre of mass system to within the limits of the statistical error, for 
the 8*8 MeV group of neutrons. There is some evidence for a periodic fluctuation of high order 
in the angular distribution of the scattered intensity which remains to be confirmed. For the 
13 MeV group the ratio of I(tt)/I(%7t) is found to be 0*67 + 0*20. 

The neutrons employed in this experiment were produced by bombarding boron with 
600 keV deuterons and arise from the reaction B n (d, n) C 12 . There is evidence that the excited 
state of C 12 at 4*5 MeV is complex. 



Applications of photographic method to problems in nuclear physics 65 


1 . Introduction 

The two principal objects of the present paper are, first, to give an account of the 
methods we have developed in the course of the past three or four years to determine 
accurately the spatial orientation of the track of a charged particle in a photographic 
emulsion; and, second, to describe the application of these methods to the investiga¬ 
tion of the scattering of fast neutrons by protons. Preliminary reports on the results 
have already been published (Powell 1940 ; Powell, Heitler &> Champion 1940 ). 

Hitherto experiments with the photographic method have been for the most part 
confined to the measurement of the range of protons and a-particles, and for this 
purpose the tracks are arranged to be inclined at only small angles with the plane 
defined by the surface of the emulsion. The importance of the first of our objectives 
lies in the fact that if the angle of dip of a track into the emulsion can be determined 
with precision, the analogy of the method with the expansion chamber becomes 
more complete, and the scope of the method for experiments in nuclear physics is 
increased. In this paper we show that the angle of dip of a particle into the emulsion 
can be determined with an accuracy of about ± 1 °. 

k The importance of the study of neutron-proton scattering at high energies lies 
in the fact that it is highly desirable, for the development of nuclear physics, to 
determine the character and the magnitude of the forces between the neutron and 
the proton at distances corresponding to the very close approach of the two particles. 
Such a determination can be most easily made at present by a study of neutron- 
proton scattering at high energies. Various theoretical papers (Massey & Bucking¬ 
ham 1937 ; Rarita & Schwinger 1941 ) indicate that, with increasing neutron energy, 
anomalies in the scattering may be expected to appear rather suddenly in the region 
of 15 MeV. 


2 . Previous experiments 

Experiments on neutron-proton scattering have been made by a number of 
investigators using the homogeneous group of neutrons which is emitted when 
deuterium is bombarded by deuterons with an energy of the order of 600 keV. The 
energy of the neutrons from such a source is of the order of 2*5 MeV. The best experi¬ 
ments, for example those of Dee & Gilbert ( 1937 ), indicate that with neutrons of 
this energy the scattering is isotropic in the centre of mass co-ordinate system 
of the colliding particles. To avoid circumlocution we refer to this as isotropic 
scattering. 

Since the preliminary report on the present work was published, Tatel ( 1942 ) 
has given an account of experiments on the same subject using a neutron source 
produced by the bombardment of beryllium by high-energy deuterons from the 
cyclotron. As with many of the light elements, beryllium, when bombarded by 
deuterons with an energy of ~8 MeV, emits neutrons with a continuous distri¬ 
bution in energy. Tatel arranged to observe the recoil protons produced by the 
neutrons in the high-energy tail of this continuous distribution, detecting them 


Vol. 183 . A. 


5 



66 F. C. Champion and C. F. Powell 

with- an ionization chamber and linear amplifier. He made observations at two 
angles only, 16 and 45 °, to the direction of the primary neutrons, and concluded 
that the results were in accord with the assumption that at 11 MeV the scattering 
is isotropic. 

Finally, in a recent theoretical paper by Hulthen ( 1943 ) on this subject, there is a 
reference to experiments by Amaldi, Bocciarelli, Feratti & Trabacchi ( 1942 ). Let 
I( 7 r) and I{\rr) be the intensities of the scattered neutrons in the centre of mass 
system of co-ordinates of the colliding particles at 180 and 90° respectively. Amaldi 
finds the ratio /( 7 r)//(| 7 r) to be between the values 0*71 and 0-52 for neutrons of 
average energy between 12*5 and 14*0 MeV. These values are taken from Hulthen’s 
paper, as we have not access to Amaldi’s publication. 


3. The neutron- source 

It is evident that the most desirable source for experiments of this type is one in 
which the neutrons are homogeneous in energy as is the case with the D + D neutrons 
at a given angle of emission to the direction of the primary beam. No such source is 
obtainable, however, at the present time for the range of energies from 8 to 15 MeV. 
The best conditions of measurement available in practice are provided by a source 
emitting a number of homogeneous groups of neutrons. If the difference in energy 
between the different groups is sufficiently great we can ensure that, over a certain 
range in the angle of scattering, the proton recoils produced by neutrons in the 
different groups can be distinguished because of the difference either in their range 
or in the total ionization which they produce. The distribution in angle of the 
intensity of the projected protons arising from the different groups can then be 
determined in this range of angles. 

Two neutron sources of this character are well known; those produced by the 
bombardment of boron or of fluorine by deuterons with an energy of from 500 to 
1000 keV, the deuterons being accelerated by d.c. high-tension generators. In the 
present work we have preferred to use the neutrons from boron because the high- 
energy neutron groups are more widely separated in energy than is the case with the 
fluorine target. The boron target emits, amongst others, two groups of fast neutrons 
of mean energy 8*8 and 13*0 MeV in a direction at right angles to the incident 
deuterons, the spread in energy in each group being less than 1*5 MeV. For the 
present work we have used the plates obtained in the experiments previously 
described (Powell & Fertel 1939 ; Powell 1943 ) in which a boron target was bom¬ 
barded by the 600 keV deuteron beam delivered by the Bristol generator. 

Two advantages, for the present experiments, of using a neutron source based on 
a high-tension generator may be emphasized. First, with proper adjustment of the 
voltage distribution across the electrodes of the main accelerating tube, and with 
careful alinement of these electrodes, the deuteron beam can be sharply focused on 
to the target. In the present experiments the area of the target from which the 
majority of the neutrons originate is a circular patch with a diameter of about 4 mm. 



Applications of photographic method to problems in nuclear physics 67 

It is therefore possible to secure conditions of good geometry without long ex¬ 
posures and the attendant dangers of a general background fog due to the X-rays, 
y-rays and neutrons coming from various parts of the apparatus. 

Secondly, with relatively slow deuteron bombardment, the number of neutrons in 
the groups of highest energy, corresponding to the formation of the product nuclei 
in the ground or lower excited states, is of the same order of magnitude as that in the 
slower neutron groups which are present. This has the consequence that the long 
recoil tracks in which we are interested do not have to be distinguished against an 
excessively large background of unwanted shorter tracks produced by slower 
neutrons. In contrast with the favourable conditions which are thus provided, 
plates obtained with a source produced by the bombardment of the light elements 
by the high-energy deuteron beams delivered by the cyclotron show very few tracks 
due to neutrons of the highest energy. It will be shown in a subsequent paper that 
the neutrons from such sources show a continuous distribution in energy with a 
maximum intensity in the energy region between 3 and 5 MeV and a very rapid fall 
in going to higher energies. The high-energy neutrons corresponding to the formation 
of product nuclei in the ground state are very rare. An additional disadvantage of 
such a source is that the plates after development show a general background of 
grains, due to the passage through the plate, during exposure, of slow neutrons and 
y-rays from the cyclotron tank, and this background contributes to the difficulty of 
distinguishing the long tracks. 

These considerations are presumably of some significance for work with counters, 
and they are decisive for experiments employing the photographic method. The 
smaller yield of neutrons from the high-tension generator as compared with the 
cyclotron is of no consequence, since even with the former the exposure times 
necessary are very short. 

4. Determination oe the spatial orientation 

OE A TRACK IN THE EMULSION 

The basic problem to be solved in determining the spatial orientation of a track 
in the emulsion is the measurement of the angle of dip of the track," i.e. the angle the 
track makes with the plane defined by the surface of the emulsion. Consideration of 
the restricted depth of focus of high-aperture objectives, 0*3/4, suggested that it might 
be possible to make such determinations by measuring the depth below the surface 
of the emulsion of different grains in the track. Our microscopes are fitted with 
graduations on the fine adjustment wheel, but the makers informed us that this 
graduation does not provide an accurate method of making depth measurements 
owing to the design of the mechanical system on which the fine motion is based. 
Tests of this op ini on using the best modern microscopes have confirmed it. We 
therefore fitted a precision depth gauge in the way illustrated in figure 1. 

The gauge is fixed to the upper arm of the optical bench of the microscope which 
carries the objective and eyepieces. The pin of the gauge, P in figure 1, bears on a 

5 m 2 



68 F. C. Champion and C. F. Powell 

ground surface, 8 , of hardened steel which is attached to the mechanical stage, which 
in turn carries the photographic plate. The gauge therefore indicates any relative 
movement of the focal plane of the objective and the photographic plate. 



vernier scale 
carried by eye¬ 
piece 


fine adjustment 
, wheels on mech¬ 
anical stage 


depth gauge 


Figure 1 . Microscope fitted with a depth gauge and vernier scales. 


The gauge is graduated in divisions representing 0*0001 in., and it was found 
possible to set a particular grain in focus with a consistency within 0*4 division, i.e. 
to within 0*001 mm., or l/i, when using the optical equipment previously described 
(Powell 1943). The actual distance between two grains which appear, when viewed 
through our microscope, to be separated by the full length of the eyepiece scale, is 
0*1 mm. One would therefore expect to be able to determine the angle at which a 
long track ‘dips’ with respect to the plane of the emulsion with an accuracy of the 
order of ±1°. This estimate takes no account of the fact that even the fastest proton 
tracks suffer small-angle deflexions from their initial direction in passing through the 
emulsion, so that the actual accuracy achieved will be less than this value* 

The second measurement necessary in order to define the direction of the track 
relative to the emulsion is the angle which the projection of the track on the plane 
of the plate makes with some standard direction. This is done by setting a line in the 
eyepiece scale to coincide in succession with the separate grains in the track as the 
microscope is focused on them in turn. The setting is made by rotating the eyepiece 



Applications of photographic method to problems in nuclear physics 69 

which carries the scale and the fiducial line. This eyepiece is provided with a vprnier 
moving over a graduated scale, and by this means the orientation of the eyepiece 
can be read to 0*1° arid the setting can be made with an accuracy of 0 * 25 °. 

For making these measurements with the highest accuracy, it is important that 
proper attention should be paid to the adjustments of the substage illumination. If 
the illumination is not properly centred, and if it is not adequate for the high aperture 
of the objective, the full resolving power is not achieved and the accuracy of the 
individual depth measurements is reduced. 

A second point requiring consideration is the fact that the field of highly corrected 
apochromatic objectives is not flat. This has the consequence that when using the 
objectives in conjunction with low-power periplanatic eyepieces, such as those 
normally employed, the grains in a track running across the centre of the field and 
precisely parallel to the surface of the emulsion are not all in focus at the same time. 
In determining the angle of dip of a track from measurements on the depth of two 
chosen grains it must therefore be arranged that the two grains are equidistant from 
the centre of the field. This is a sufficient condition for a reliable observation, since 
the curvature of the field is axially symmetrical. 

In some emulsions, using very small plates with an area of 2-3 cm. 2 , we have found 
that over some areas of the plate the tracks all show a slow continuous change of 
direction with depth. This effect only occurs rarely, and we believe it is due to the 
fact that the emulsion before development is under strain introduced during the 
process of manufacture, as the plate is drying, and to the release of these strains 
during development when the plate is wet. It is therefore desirable to use larger 
plates and to confine measurements to the central areas. Distortion of the gelatine 
is then negligible. 

5. Determination - of the energy of the projected 

PROTONS AND THE ANGLE OF SCATTERING 

Let OP of figure 2 represent a track in the emulsion. OA, which lies in a plane 
OABC , parallel to the surface of the emulsion, is the direction of the neutron stream 
through the point 0 during the exposure. Q is the projection of P on the plane OABC , 
and R is the projection of Q on the line OA. In measuring a track, the fiducial line 
carried by the rotating eyepiece is first set so that as the successive grains in the 
track are brought into focus they are seen to coincide with the fine. In practice the 
ends of the graduations of the eyepiece scale are used as a fiducial line, and the 
beginning of the track is set to coincide with the beginning of the scale. The angle 
through which the eyepiece has been rotated from the standard position in which it 
is parallel to OA can then be determined from the reading of the vernier. This angle, 
which we call the angle of inclination, a, is the angle ROQ in figure 2. The difference 
in depth of the first grain in the track at 0 and the grain at the point P is determined 
from the readings of the depth gauge and is the length PQ in figure 2. Let PQ = d 
and let the length OQ, measured directly by the eyepiece scale, be Z. The angle of 



70 F. C. Champion and C. F. Powell 

dip of the track, /?, is then equal to Uir^d/l) * It is required to determine the angle 
ROP which the track makes with the original direction of the neutron stream 
through O, together with the length L of the track. If ROP = 6 , then 

L = lsec/H, ( 1 ) 

cos 6 = cos a cos /?. (2) 



Figure 2. Definition of the angular characteristics of a track. 

Knowing the stopping power of the emulsion and the range-energy relationship, 
the length L of the track enables one to determine the energy, E, of the proton 
producing it. The energy, E 0 , of the neutron giving rise to the recoil proton is then 
given by the equation E 0 = E sec 2 6 , It is thus possible to deduce the energy distribu¬ 
tion of the original neutron beam from observations, of all proton tracks and not 
merely from those projected forward within a small angle of the direction of the beam. 
The accuracy of the determination of E 0 must be expected to decrease rapidly with 
increasing values of d according to the relation 

SE n = 2 E fi tmd 86 . 

Since it is required to measure the relative probability of scattering in different 
directions,' it is necessary to ensure that all tracks within closely defined limits are 
measured, and checks against any tendency of an observer to choose tracks with 
particular characteristics must be applied. For this purpose maps are constructed 
of those areas of the plate on which measurements are made. In addition to the 
angle of inclination, these maps show the co-ordinates otthe beginning and end of 
each track as deduced from the readings of the verniers on the x and y movements 
of the mechanical stage. For accurate mapping it is necessary to determine the x 

* In practice the angle ft is determined from the difference in depth of two grains, near 
the beginning of the track and separated by the length of the eye-piece scale (0*1 mm .). 










Applications of photographic method to problems in nuclear physics 71 

and y co-ordinates of a point in the plate with greater precision than is allowed with 
scales and verniers normally supplied with the mechanical stage, which read to 
0*1 mm. The milled knobs controlling the movements of stage were therefore fitted 
with graduated wheels with suitable vernier scales as shown in figure 1. The maps 
allow the observer to re-examine a given plate for any tracks that he may have over¬ 
looked and allow one to compare the efficiencies of different observers in this respect. 
In practice it is found that with experienced observers the number of tracks missed 
is very small, and the differences between the measurements of different observers 
on the same track are trivial. 


6. Energy distribution of the neutrons 

An area of 200 mm. 2 has been examined by the methods described in the previous 
paragraph, and 4000 tracks contained in it, which were produced by neutrons with an 
energy greater than 3-5 MeV, have been mapped. Tracks produced by neutrons of 
lower energy were ignored. This total area was made up of two nearly equal parts 
taken from two separate plates which were similarly disposed during exposure with 
respect to the axis of the primary deuteron beam. The two sets of measurements,. 
A and B, were treated separately in the following way. In set A, the calculations 
were carried through for all tracks produced by neutrons with an original energy 
greater than 3*5 MeV. In set B , the calculations were restricted to those tracks 
which correspond to an original neutron energy greater than 7 MeV. From the 
measurements on the tracks from the two sets which have values of 6 less than 10°, 
together with the measurements on similar plates taken with the boron target 
described in the previous paper, it is possible to make an estimate of the homogeneity 
of the neutrons in the different groups forming the neutron spectrum emitted from 
the target, and the results are shown in figure 3 . 

It will be seen from this figure that the width at half-maximum of the 13 MeV 
group, which corresponds to the formation of C 12 in the ground state, is 0*3 MeV, 
a value which is in good agreement with the results obtained in a previous paper 
(Powell 1943), where it was shown that the experimental width at half-maximum 
of homogeneous groups of protons was nearly constant over the energy range from 
3 to 10 MeV and equal to 0*25 MeV. 

In contrast with this result the width of the 8*8 MeV group is 0*8 MeV, a value 
more than three times as great as that obtained with the protons of nearly the same 
energy, coming from the reaction B 10 (d, > p)B 11 (Powell 1943). It is therefore clear 
that the neutrons in this group are not homogeneous in energy. This spread in energy 
cannot be due to a short lifetime in the excited state of C 12 at 4*6 MeV, since this 
state is not of sufficient energy to* lead to the disintegration of C 12 into three a- 
particles. The most reasonable explanation would therefore seem to be that the 
state is a multiplet. The form of the neutron distribution is compatible with the 
state being a doublet with a separation of about 0*5 MeV. The mean energy of the 
excited state as deduced from the present experiments, 4-5 MeV, is rather higher 



72 


F. C. Champion and C. F. Powell 

than that obtained by other observers. Thus Cockcroft & Lewis ( 1936 ) obtained the 
value 4*2 MeV from a study of a-particles emitted from the reaction N 14 (d, oc) C 12 , 
whilst Bonner & Brubaker ( 1936 ) give the value 4*4 from an examination of the 
neutrons from the reaction used in the present work. The Q value obtained in the 
present work for the production of C 12 in the ground state, 13*6, is in good agreement 
with that expected from the known masses of the nuclei involved, viz. 13*68. For 
the purposes of the present experiments it is sufficient to note that the width of 
peaks at half-maximum is less than 0*9 MeV, and the full width less than 1*5 MeV. 



energy in MeV 

Figure 3. Distribution in energy of the groups of fast neutrons. 


7. Accuracy of the angular measurements 

It has already%been seen that the estimated errors in the determination of the 
angle of inclination of a straight track is of the order of 0-25° and in the angle of 
dip, 1 • It is shown later that it is desirable to employ only those tracks which make 
angles of dip less than 10 °. Since the actual angle of scattering, 0, is given by the 
relation cos 8 = cos a cos/?, it follows that for most of the tracks the greater part 
of the error in the determination of 6 will come from errors in the determination of a, 
so that the errors in the determination of the orientation of a track can be expected 
to be less than + 1°. Thus 

cos 6 = cos a cos /?, 

89 = (cos a sin/? 6/3+ cos/? sin a See) eosec d. 
a = 20°, {! = 5°, dtx = +0-25° and £/? = +l°, 

6 = 20° 36' and £0 = 0-47°. 


and hence 
Putting . 
then 




Applications of photographic method to problems in nuclear physics 73 

It is a satisfactory feature of the method that with the restriction \/8\< 10°, the 
errors in 6 due to errors in /? are greatest where they are of least consequence. If the 
above value for SO is substituted in the relation 8E 0 IE 0 — 2 tan 686, then 

8E 0 /E 0 = 2 (tan a da + tan/?£/?). 

Substituting the maximum errors 8/S = ± 1° and 8oc = ± 025°, then it is found that 
the maximum value of 8E Q jE Q varies from 0*6 % at small angles to 1-4 % at 45°, 
errors much smaller than the spread in a peak due to straggling. Unfortunately, it 
is not possible to display the full potentialities of the method in the present experi¬ 
ments on account of the uncertainties in the precise direction of the neutron stream 
relative to the plates during the exposure, for these uncertainties fead to larger 
errors in the determination of 6 than those involved in the determination of the 
precise orientation of a track in the emulsion. 

The uncertainty in the direction of the neutron flux arises from two causes. Tirst, 
the region of the target bombarded by the neutron stream is estimated to have had a 
diameter of 4 mm. Since the plate was at a mean distance of 62 mm. from the source 
there is an uncertainty in the direction of the neutron stream at any point in the 
plate of ± 2°. Secondly, no attempt was made during the exposure of the plates to 
define the orientation of the plate with respect to a line drawn from the centre of 
the plate to the centre of the source. The original exposures were made for the 
purpose of deducing the energy distribution in the neutrons from the source by the 
methods described in Part I, and for this purpose an error of 2° in the setting of 
the plates is of no consequence. We estimate the errors in the measurement of 6 
arising from these deficiencies in the experimental arrangement to be ± 3°. 



Figure 4. Variation in direction of primary neutron flux through the plate. 


As a consequence of this source of error in the determination of 6 we have not 
considered it worth while to make any allowance for the variation in the direction 
of the incident neutron stream in passing across the plate. The variation of this 
direction over the measured area is shown in figure 4, which is constructed on the 
assumption that the line joining the centre of the plate to the centre of the source 
runs parallel to the long edge of the plate. It remains to show that the uncertainties 
in the determination of 6 are of no serious importance for the present experiments. 



iece scale divisions (1 division = 1*46 cm, standard air) 






Applications of photographic method to problems in nuclear physics 75 

8. Tests of the accuracy of the measurements 

In order to display the experimental results and to show that the accuracy in the 
determination of 6 is within ± 3°, we show, in figure 5, the range of all measured 
tracks, of length greater than 12 cm. air equivalent, plotted against the corre¬ 
sponding angle of scattering, 6 . 

If there were no straggling of the proton tracks in the emulsion, and if the deter¬ 
minations of 6 were made with perfect precision, the points representing the results, 
when plotted in the manner shown in figure 5, would be expected to lie on one of 
four lines, provided the four neutron groups were homogeneous in energy. The line for 
each group would be defined, (a) by the fact that the energy E of a proton, projected 
at an angle 6, is related to the energy, U 0 , of the neutron producing it by the equation 
E = E 0 cos 2 6 and, (6) by the range-energy relation for the emulsion. In figure 6, AB 
represents a portion of such a line. Owing to straggling, and to the inhomogenelty 
in the different neutron groups, it must be expected that in practice the lines will be 
broadened into bands with a total width corresponding at an angle 6 , to a variation 
of ± 0-5 MeV in the proton energy. In figure 6, a is a point on AB corresponding to 
a particular value of 0, and b and c are plotted for proton energies 0*5 above and 
0*5 MeV below the value corresponding to the point a. If the errors due to straggling 
were alone present, then most of the experimental points would be expected to lie 
in a band between two curves defined as the loci of b and c as a moves along AB. 



Figure 6. Effect of straggling and errors in the angular determinations on 
the range distribution at different angles of scattering. 

A further spread in the experimental observations must be expected as a con¬ 
sequence of the uncertainty in the determination of 9 . To take account of this the 
points d and e are plotted in figure 6 so that the lines dc and be have lengths corre¬ 
sponding to 3°. If the errors in fee determination of 6 are not greater than + 3 , then 
the experimental results, corresponding to any one of the neutron groups, would 



76 F. C. Champion and C. F. Powell 

be expected to lie within a band defined by the locus of d and e as a moves along; 
the curve AB. 

In figure 5 we include dotted lines giving the extent of the bands corresponding 
to each neutron group as defined in this way. It is clear that the great majority of 
the points he within the bands, and that the bands corresponding to the three groups 
of highest energy do not seriously overlap, in the range from 0 to 45°. There is slight 
overlapping of the bands due to the 6 and 9 MeV groups between 40 and 45°, but the 
number of tracks common to both bands is small. It may therefore be concluded 
that in general the tracks which have been produced by neutrons from the 9 and 
from the 13 MeV groups can be distinguished without ambiguity. 



Figure 7. Distribution in energy of the primary neutrons as deduced from 
measurements on tracks in different intervals in the angle of scattering 


the *T > b ? T* Sented b y P lottta S »b. distribution of 

measurements of all tracks in the ranges of angles 0-10° ^0-20°, 20-30°, 30-40° 



Applications of photographic method to problems in nuclear physics 77 

and 40-45°. In this case a steady increase in the errors in the determination of E 0 
must be expected with increasing values of 6, according to the relation 

SE 0 = 2E 0 t&n6 8d } 

and the increasing spread in energy distribution corresponding to a given group is 
in satisfactory agreement with that to be expected. 

Beyond 45° the overlapping of the bands is too serious to allow precise conclusions 
to be drawn about the origin of the tracks. It seems reasonable to assume that it 
would be possible to reduce the errors in the values of 0 to + 1°, if proper care were 
taken to define the position of the plates at a sufficient distance from the neutron 



Figure 8. Improvement in resolution of range distribution of high-energy 
groups to be expected from improved geometry. 

source. Figure 8 shows the bands for the three groups of highest energy calculated 
for an error oft + 1° in the value of 6. It will be seen that the bands for the 9 and 
13 MeV groups remain separated up to 60° in this case. Our original plates were 
exposed in the middle of 1939, and further exposures under more satisfactory 
geometrical conditions should have been made, but with the outbreak of war the 
high-tension set was temporarily dismantled. In'the original exposures the plates 
were placed close to the source in order to reduce the ratio of the neutron flux through 
the plates due to neutrons scattered from the walls, etc., to that from the source. 
Later experience has. shown that this precaution was unnecessary. It would be an 
easy matter to place the plates 20 cm. from the source without seriously increasing 
the necessary exposure time. With the experimental arrangement described in 
Part I, this would reduce the uncertainty in the value of 6, due to the finite size of 
the source, to ± J°. 






78 


F. C. Champion and C. F. Powell 


9. Escape op tracks prom the emulsion and corrections por loss 

In analysing the results a complication is introduced by the fact that the emulsion 
is of limited thickness, so that some tracks which start in the emulsion appear 
to end at the surface or at the glass. In general in such cases, the proton producing 
the track has left the emulsion before coming to rest, and therefore the length of 
the track does not give a measure of the proton energy. Such tracks are rejected 
since it is not possible to decide to which group the neutron responsible for the 
track belongs, and allowance must be made for this loss in calculating the scattered 
intensity at any angle from the observations. A method of treating this problem has 
been described by Richards ( 1941 ) for the case of an experiment in which the 
tracks accepted for measurement are those for which the angle of scattering is less 
than 12 °. 

Let _D be the thickness of the emulsion. Ignore straggling and assume that a 
proton of energy E produces a track of length L. Let /? be the angle of dip of a track 
of length L which starts in the surface of the emulsion, and which therefore ends at 
depth L sin [j, i.e. at a distance D — L sin /? from the glass of the plate. It follows that 
of all the tracks produced by protons with characteristic E, /?, which start in the 
emulsion, only those will end in the emulsion and be accepted for measurement 
which start within a depth D — L sin J3 from the surface of the emulsion. All other 
tracks of this class will pass out of the emulsion into the glass and will be neglected. 
It follows that only the fraction/ = 1 — Lsinfi/D of tracks produced by protons of 
this class will be measured. This calculation takes no account of the scattering of 
tracks through small angles, but the effect of scattering is small, since the probability 
of a proton being scattered so that it remains in the emulsion when, but for scat¬ 
tering, it would have passed out, is nearly exactly balanced by the reverse effect 
of particles being scattered out of the emulsion which would have otherwise 
remained in. 

To examine the effects of the escape of protons from the emulsion we now represent 
the experimental results in a new way. We assemble together the measurements for 
all tracks which are identified as being produced by one of the two neutron groups 
of highest energy. As a simple criterion we attribute all tracks for which the cal¬ 
culated value of E 0 lies between 7 and 11 MeV to the 9 MeV group; and between 11 
and 15 MeV to the 13 MeV group. We represent the results for all the tracks from 
one group in a diagram in which we plot for each track the quantity sin ft as ordinate 
against cos/? sin a as abscissa. In such a diagram reference to figure 2 shows that 
lines of constant angle of scattering, 6, are circles round the origin, and the figure is 
essentially a polar diagram representing the distribution in angle of the scattered 
particles. The results for the 9 MeV group are represented in figure 9 . 

If there were no loss of tracks at the surfaces of the emulsion and if there are no 
polarization effects associated with the neutron stream passing through the plates, 
one would expect the points representing the observations to he distributed with 
circular symmetry about the pole of the diagram, within the limits of statistical 




80 , F. C. Champion and C. F. Powell 

error. It will be seen that in fact there is a marked lack of circular symmetry, the 
tracks being confined to a region close to the X axis corresponding to the smaller 
values of /?, an effect clearly due to the escape of tracks with the larger angles of dip: 

It has been shewn that the fraction, /, of tracks of the class (E, L P jS) which remain 
in the emulsion, is given by 

/= 1 -Lsinj3/D. 

Suppose the neutron group to be homogeneous in energy with the value E 0 . The energy 
E of protons projected at an angle Q is then equal to E 0 cos 2 6, and their range L can 
be determined. Corresponding to this value of 6, the successive values of jS for which 
/ has the values 0, 0*1, 0*2, 0*3, etc. can be determined from the equation above. 
Making such calculations for different values of 0, lines of constant / can be con¬ 
structed, and such lines have been included in figure 9. One quadrant of the 
corresponding figure for the 13-0 MeV group is shown in figure 10. 



Figure 10. Lines of constant / and 6 for 13 MeV group. 


In figure 9 all but 3 % of the points He within the region defined by the lines f = 0. 
Of the twenty tracks which lie outside this region, ten would fall within it if the 
observed values of /? were changed by 1°. The agreement between the calculations 
and the observations is therefore very satisfactory in this respect, since the effects 
of scattering, of straggling, and of any inhomogeneity in the neutron energy in the 
group have been ignored. 

Now consider the region of the diagram confined between two lines of constant 
angle of scattering, ,6 and 6 + 86. In the absence of the escape of protons the points 
representing the tracks would be expected to be uniformly scattered over the region, 





Applications of photographic method to problems in nuclear physics 81 

the number in any area 8A being proportional to 8A within the limits of statistical, 
error. Owing to 4 escape 9 the probability will be changed to f8A, where/is the mean 
value of / taken over the area A. The actual distributions for intervals in 6 from 
10 to 20°, 20 to 30°, etc. have been examined. For each interval, the areas defined 
between the lines / = 0 to / = 0*2, / = 0*2 to / = 0*4, etc. are measured, from a 
large-scale drawing, by means of a planimeter. Consider a region of the diagram 
confined between the lines 6 = 0 l9 # 2 . If, in this region, v is the number of points in 
a small area 8A , then v/SA should be proportional to the mean value of/for the area 
chosen whatever its location in the region. There is consistency between the obser¬ 
vations and the expected distribution to within the limits of the statistical error, 
and it can therefore be assumed that there is no serious loss in the efficiency of the 
observers in distinguishing tracks in the range in /? involved in the measurements, 
i.e. from j3 = 0° to j3 = 15°. 

A similar analysis of the results for the higher angles of scattering shows that for 
angles of dip greater than 30° there is an increasing tendency for the observers to 
fail to distinguish tracks. In analysing the results to determine the variation in the 
intensity of scattering with angle only those tracks have been taken for which 
JS< 10°. The proportion of measured tracks thus rejected is only 10% because of 
the small values of / for the larger values of ft, and by imposing this limitation it 
can be assured that no errors due to the failure of the observer to distinguish tracks 
are introduced into the results. 

10. Determination oe the scattered intensity 

The analysis described in the previous paragraph enables one to calculate the 
fraction, F> of all the tracks starting in a given area of the plate, and produced by 
a given neutron group, which will be accepted for measurement. Consider the range 
of angles from 6 X to d 2 in the appropriate diagram of the type shown in figures 9 
and 10, and let the mean number of tracks per unit area of this diagram, which would 
be observed in the absence of loss, be v e . The total number of tracks in this range in 
the angle of scattering is then v Q 7 r(sin 2 0 2 — sin 2 df). Since these tracks for which 
| j31 > 10° are rejected, the number of tracks actually measured and accepted will 

equal 4v 0 jfdA, where the integration is carried out in one quadrant of the diagram 

only, from 6 = 6 X to 6 = 0 2 and from ft = 0 to J3 = 10°. The value*of F for this 
range in the angle of scattering, F d9 is then given by 

F q as 4 J/iA/7r(sin 2 0 2 —sin 2 5 x ). 

The values of F e have been calculated for the 9 and 13 MeV groups in the range 
from Q = 0° to 6 = 45° by means of a planimeter and a large-scale drawing as described 
in the previous paragraph, taking intervals of 0*1 in the value of/, and of 2*5° in 
the value of 6 . The values of F e thus obtained are shown for the two neutron groups 
of highest energy in figure 11, 


Vol. 183. A. 


6 



82 


F. C. Champion and CL F. Powell 

Let n e 86 be the number of measured tracks in the range from 6 to 6 + 86, /? < 10°. 
The total number of tracks of this type starting in the emulsion in the area examined 
is then given, within the limits of statistical error, by 

N e 86 = n Q 86 jF #. 

If I d represents the intensity of the scattered protons per unit solid angle, then 
I Q 2n sin 6 86 = N d 86 and I e = kN 6 cosec 6, where x is a constant. 



Figure 11. Variation of F e and F d sin 26 with 6 for 8*8 and 13*0 MeV 
groups; F e sin 26 in arbitrary units. 

Since the proton and the neutron are of nearly equal mass, a proton projected at 
an angle 6 in the laboratory co-ordinate system has an angle of projection <j> = 2d in 
the co-ordinate system of the centre of mass of the colliding particles,. If 1$ is the 
intensity of scattering per unit solid angle in the centre of mass system, then 

L$m<f)8<l> = -4 sin#50; L = L—i— = k x N 6 cosec 26. 

It follows that if the scattering^is isotropic in the centre of mass system, eon^, 
sponding to a constant value for 1$ over the whole range of angles, then n e /F e sin 26 
should also be constant, i.e. n Q proportional to F e sin 26. The investigated range from 
0 to 45° in the angle of the projected protons corresponds to a range from 0 to 90° 
in the centre of mass system. The intensity of the projected protons in the centre of 
mass system at angle <j) is equal to the intensity of the neutrons scattered through 
7r—<f> in this system. Our results therefore give the variation of the scattered in- 
tensity for the neutrons in the backward direction, i.e. from 90 to 180° in the centre 
of mass system. 





Applications of photographic method to problems in nuclear physics 83 

11. Results eor the 8*8 MeV group oe neutrons 

The results obtained for the 8*8 MeV neutron group are represented in figure 12, 
in which is plotted the observed numbers of tracks in successive intervals of 1° in 
the angle of scattering in the range from 0 to 45°. In this figure the full line represents 
the distribution to be expected if the scattering is isotropic. The form of this line is 
given by the relation I e = GF e sin 26, where the constant C is chosen to give a dis¬ 
tribution corresponding to the number of tracks, 881, which fulfil the angular 
restrictions /?< 10°, 0<45°, and which are identified as being produced by the 
8- 8 MeV group. It is a satisfactory feature of the method that for this energy isotropic 
scattering would produce an almost uniform distribution of tracks in the range from 
0 to 45° in the angle of scattering. 

It will be seen from figure 12 that the general trend of the results is in accord with 
the assumption of isotropic scattering. This result is displayed in table 1, where we 



Figure 12. 8*8 MeV group. Observed distribution of projected protons in successive intervals 
of 1° in the angle of scattering* The full line a shows the distribution expected if scattering 
is isotropic. Dotted line 6 shows estimated best smooth curve. 


Table 1 


gular interval 

expected number 

observed 

in degrees 

of tracks 

number 

0- 9 

134 

173J 

9-18 

171 

173J 

18-27 

181 

159 

27-36' 

195 

188 

36-45 

200 

. 186J 


6-2 


84 F. C. Champion and C. F. Powell 

compare the expected number of tracks in successive range intervals of 9°, with the 
number observed. 

An alternative method of representation is shown in figure 13, where we plot, for 
successive intervals of 2° in the angle of scattering, the ratio of the observed number 
of tracks to that expected. In this diagram isotropic scattering would be represented 
by the horizontal line corresponding to unity in the ratio of the observed to the 
calculated intensity. The full line in this figure is the estimated best smooth curve 
through the experimental points and has a form roughly similar to that expected if 
there were a small ^-component in the scattered intensity. The dotted curve b in 
figure 12 is then drawn to correspond with the full line in figure 13, and also corre¬ 
sponds to a total of 881 tracks. 


M 

-p & 

, 0 
-p 

§ 3 

■fl .2 
1* & 
a $ £ 

rtf O O 

0 2 .9 
> 


1 ® 
rQ *2 
O ® 

o 5 


oq 



A 








/ 

'l 

\ 

\ 

\ 


A 
/ \ 
to -\ 

-—i 

* 



0 .. 



— 1 — 

/ 

_c i_ 

\-i 

oV 

"- + 

5 0 / 
v / 

o 

3—-- 

/ 

N 

, \ 


\ 


1 / 


„_o\o_ 





'<w 


’“S. 


" * 











45 ° 


angle of scattering 


Figure 13 . 8*8 MeV group. Ratio of observed number of tracks in successive intervals 
" of 2° to that expected if scattering is isotropic. Dotted line shows best smooth curve. 


An inspection of figure 12 shows that there is some indication of a periodic varia¬ 
tion of high order in the intensity, especially in the angular rang© from 0 to 30°. 
We display these fluctuations in figure 14, where we plot for every integral value of 
the angle of scattering the number of tracks in angular intervals of 3°. It will be 
seen that there are maxima at 6,18 and 30°, and minima at 12 and 24° in this curve. 
Whilst the existence of variations of this character in the scattered intensity seemed 
very unlikely, we have calculated the probability that the observed departures 
from the smooth curve shall be as great as those actually found if they are due to 
statistical fluctuations alone. If m is the expected value for the intensity at any 
angle and m + x the observed value, then the quantity % 2 = Zx*jm can be calculated 
for the observations. Then the level of significance of the results can be derived from 
the values of x 2 > calculated from a normal distribution, given by Fisher & Yates 
( I 93^)* Taking the number of tracks in intervals of 1°, for the range from 1 to 25°, 
and taking the expected distribution to be that corresponding to the full curve in 
figure 13, it is found that x 2 = 29*2. Since the dotted curve corresponds to the total 
number of tracks actually observed, the number of independent groups is 24 — 1 = 23. 




Applications of photographic method to problems in nuclear physics 85 

The table for x 2 then indicates that the probability that the errors will be equal to 
or greater than those actually observed is 7 %. If a similar calculation is made 
grouping the tracks in successive range angular intervals of 2°, instead of 1°, an almost 
equal level of significance is found. The level of significance is usually fixed 
arbitrarily by statisticians at 5 %, so they would not regard the departure from a 
smooth distribution as definitely established, but physicists commonly employ a 
very much less stringent test of significance. 

It is very difficult to account for a variation of the type suggested by these results 
as a consequence of deficiencies or errors in the apparatus or the method. Any 
broadening of the source, or any errors in the estimate of its location, would in 
general lead to a blurring out of the angular resolving power of the method without 



Figure 14. 8*8 MeV group. Number of tracks in intervals of 3° 
in the angle of scattering plotted for every degree. 

introducing harmonic variations of the type suggested by the results if the scattering 
is in fact isotropic. It is also most unlikely that the fluctuations can have been 
introduced through an unconscious prejudice on the part of the observers, because 
these features of the results were only suspected a year after the conclusion of the 
measurements when we found time from other duties to carry through the neces¬ 
sary calculations on a large part of the experimental material. Further, checks on 
the accuracy of the measurements of one observer by another, which are made 
possible by the maps of the measured areas, have never disclosed any appreciable 
errors in the measurements or any tendency of an observer to choose tracks 
with special characteristics. It therefore seems desirable to continue the measure¬ 
ments on plates exposed in similar conditions to those already employed in order 
that the statistical errors may be further reduced. If the existence of these fluctua¬ 
tions is then maintained, the experiment should be repeated using a neutron group 
of similar energy from a different source such as is provided by the bombardment of 
fluorine by deuterons. We suggest such an experiment because in general the 
methods of observation hitherto available determine only the energy and momentum 






86 


P. C. Champion and C, F. Powell 

of the fast neutrons emitted from nuclear reactions. It is unnecessary to stress 
the importance for nuclear theory of the variations suggested by the present 
experiments should they prove to be real. 

12. Results for the 13 MeV group 

The number of tracks produced by the 13 MeV group, which have been measured, 
is only 132, so that no detailed study of the distribution of the intensity of the 
scattered particles with angle is possible. The results are represented in figure 14 
where we plot the number of tracks in successive range intervals of 5°. The cal¬ 
culated distribution to be expected if the scattering is in fact isotropic is shown by 
the full line in this figure. 

h 30 


.fl 


20 


1 


0° 10° 20° 30° 40° 
angle of scattering 

Figure 15. 13*0 MeV group. Observed distribution of scattered particles in successive 
intervals of 5°. Full line gives distribution expected if scattering is isotropic. 

Figure 15 shows that the present results are in accord with the results of Amaldi 
and others to which reference has already been made. Thus our value for the 
ratio of the intensity of the scattered neutrons in the centre of gravity system 
of the colliding particles J(ar)/J(jr/2) is 0-67 ± 0-20. This value is based on a com¬ 
parison of the expected number of protons projected at angle s from 0 to 10° 
and from 35 to 45° with those actually observed, and agrees to within the limi ts of 
the statistical error with the values ranging from 0-52 to 0-71 obtained by Amaldi . 
for neutrons of approximately the same energy. It is clear that for this group of 
neutrons also it is desirable to extend the observations in order that the statistical 
error may be further reduced. 

We are greatly indebted to Professor A. M. Tyndall for his interest in the progress 
of the work and for making available to us the facilities of the laboratory. Our thanks 
are also due to Mr H. Heitler for his collaboration in the earlier measurements, and 
to Miss M. Littleton for assistance in some of the calculations. 



Applications of photographic method to problems in nuclear physics 87 


References 

Amaldi, Bocciarelli, Feratti & Trabacchi 1942 Naturwissemchaften, 30, 582. 

Bonner & Brubaker 1936 Phys. Rev . 50, 308. 

Chadwick & Goldhaber 1935 Proc. Roy. Soc . A, 151, 479. 

Cockcroft & Lewis 1936 Proc, Roy, Soc, A, 154, 261. 

Dee & Gilbert 1937 & roc • & oc - A, 163, 265, 

Fisher & Yates 1938 Statistical tables for agricultural , biological and medical research. Olivet 
and Boyd. 

Hulthen 1943 Phys. Rev. 63, 383. 

Massey & Buckingham 1937 Proc. Roy. Soc. A, 163, 281. 

Powell & Fertel 1939 Nature , Lond. 9 144, 115. 

Powell 1940 Nature , Lond ., 145, 155. 

Powell, Heitler & Champion 1940 Nature , Lond. 9 146, 716. 

Powell 1943 Proc. Roy. Soc. A, 181, 344. 

Rarita & Schwinger 1941 P%s. Rev. 59, 556. 

Richards 1941 Phys. Rev. 59, 796. 

Shapiro 1941 Pew. Mod. Phys. 13, 58. 

Tatel 1942 Phys. Rev. 61, 450. 


On the equation of state for solids 

By R. Furth, Edinburgh 

(Communicated by M. Born, F.R.S.—Received 27 October 1943) 

The problem of the thermodynamics of crystal lattices has been treated by rigorous methods 
recently in a series of papers by Bom and collaborators. In particular, Bradburn succeeded 
in deriving the equation of state for a solid cubic crystal, consisting of identical atoms, under 
the assumption that the mutual potential energy of a pair of atoms satisfies a law of the form 
<f> = —ar~ m +br~ n . In the present paper a method is developed which makes it possible to 
determine the exponents m and n in the force law for a given element from measurements of 
the sublimation energy, the compressibility, the thermal expansion coefficient, and the 
dependence of these quantities on pressure and temperature. The method is applied to a large 
number of elements, and it is shown that the compression and the thermal expansion of 
these substances, as predicted by the theory, are in satisfactory agreement with the 
measured values of these quantities up to very high pressure and up to temperatures near 
the melting-point. The question whether melting is caused by the mechanical instability of 
the lattice is also investigated, and a certain rule connecting the two phenomena is found 
which is closely related to Lindemann’s law. 

1 

The problem of deriving the equation of state of a solid substance from its molecular 
properties can he solved in two steps. First, the structure of the solid at zero tem¬ 
perature and pressure can be found as the equilibrium configuration of a system of 
particles under the action of the interatomic forces. Secondly, the change of volume 
for finite temperatures and pressures may be obtained by applying the methods of 
statistical thermodynamics. For the first task it is essential to know the correct 



88 


R Furth 


form of the interatomic force law. Unfortunately, the calculation of the potential 
energy of an assembly of given particles as a function of their mutual distances is 
a very difficult problem, and it has been solved by quantum mechanical methods in 
a few cases only (see e.g. Mott & Jones 1936 ). If one is content, however, to assume 
as given the lattice structure, the lattice constant, and the lattice energy, the second 
task can be performed by supposing a fictitious force law, provided that it resembles 
the real one in some general features and can be made to fit it in a narrow region 
surrounding-the equilibrium point; for the structure generally remains unaltered 
and the density changes only slightly even at high pressure and up to the melting 
temperature. b 



FlGURB L MutUal P ° tential ener 8y of two at0 “8 ^ a function of their distance apart. 


* of a pair of the constituent 
parades of the crystal lattice a> a function of the mutual distance r between them 

—tStr" by ,‘ he ^ ° f L T1 * » maiidy 1 deter- 

, J quantities, namely, the co-ordinates of the minimum point which 

at th™ 1 ^ • 6 6quilibrium Stance r 0 and the dissociation energy u, the curvature 
at the mnrnnum pomt and the position of the inflexion point ^ which deteTmffie 

po^ri, ae Eft tone ^ b °“*°<>dofithe eqnilibrinm 
neighbours those parts nftho ^ 1S provided tiie forces between nearest 

will not essentiallv affect tb CUr Tf W clx are farther away from the minimum point 
preserved Bv +h J 6 as long as the general character of the curve is 

LJL , ""“ i ^ at 0 depends on the mutiml 

not central as, for eaample, £ ZTZ of meX ^ **“ *“ 

he most simple approximation of this kind is a function of the form 


$ — —ar- 


( m<n ), 


* * S ” P ' rP08iti0 “ and a 

g force, both dependmg on the distance , by a simple inverse power faction, 





89 


On the equation of state for solids 

as is usually assumed in the kinetic theory of gases (Mie 1903). This force law also 
satisfies the necessary condition that the lattice held together by such forces must 
be mechanically stable for all kinds of small perturbations, for all values of m and n 
in the case of the cubic face-centred lattice, and for values of m and n smaller than 5 
in the case of the body-centred cubic lattice (Born 1940; Misra 1940). 

So much for the force law. The main problem in obtaining the equation of state 
is now to derive an expression for the free energy of the crystal lattice by the methods 
of statistical thermodynamics. For if this quantity is known as a function of tem¬ 
perature and volume the equation of state follows by known thermodynamical 
rules. For this purpose one has to consider the lattice, consisting of N identical 
particles, as a mechanical system of 3 N degrees of freedom, capable of 3 N normal 
modes of vibration about the equilibrium configuration. If all the proper frequencies 
are known the free energy can be calculated with the help of a well-known theorem of 
statistical thermodynamics. 

The realization of this programme is, of course, not a simple task and attempts 
have been made to simplify it. One way to achieve this consists in replacing all the 
3 N proper frequencies by a single one by adopting the 'Einstein model* of a solid 
in which each particle is supposed to vibrate about its equilibrium position in¬ 
dependently of the motion of the others (Griineisen 1912), or by adopting the 
'Debye model* where the solid is considered as a continuum with a continuous 
spectrum of proper frequencies up to an upper limit, the ' Debye frequency* (Debye 
1914). 

An extensive investigation was carried out by Griineisen in order to check the 
results of Mie*s (1903) and his own theory of the equation of state with the empirical 
facts, and he was able to show that certain rules connecting the sublimation energy, 
the compressibility, the coefficient of thermal expansion, etc., which follow from the 
theory were roughly satisfied by a great number of substances (see e.g. Griineisen 
1926). It must be expected, however, that a theory based on the assumption of a 
single characteristic frequency or of a continuous spectrum of frequencies will never 
yield more than a rough approximation, as these assumptions are not compatible 
with the crystal structure of the solid body which is an essential feature of the solid 
state. 

The general methods for solving the problem rigorously are mainly due to Bom 
(see e.g. Born 1923) and Stern (1916). The first attempt to derive the equation of 
state and expressions for the elastic constants in the limiting case of sufficiently high 
temperatures (i.e. high compared with the Debye temperature) is also due to Born 
(1939). He supposed a monatomic substance with a body-centred lattice and a force 
law of the type (1) with exponents m = 6 and n = 12, a combination which seemed 
justified both by theoretical considerations and because it had been found to fit 
the observations on rare gases (Lennard-Jones 1937). A comparison of the main 
results of this theory with the experiments on similar lines as in Gruneisen*s work 
has been carried out elsewhere (Fiirth 1941^,6). But here again only a comparatively 
rough agreement could be expected, as considerable simplifications had to be in- 



90 


R. Fiirth 


troduced in the course of the theoretical calculations, and furthermore the assump¬ 
tion of a general force law with the above-mentioned values for the exponents 
for all substances could hardly be justified; also the body-centred lattice is not 
mechanically stable under these conditions. 

The work was therefore resumed by Bradburn (Born & Bradburn 1943; Bradbum 
1943), who succeeded in solving the problem rigorously for a face-centred cubic 
lattice; she also gave the equation of state in a numerical form for the same force 
law. She showed further that by making certain alterations in the averaging pro¬ 
cesses a comparatively simple formula could be derived for the equation of state of 
a crystal with a cubic lattice and for arbitrary exponents in the force law, which 
coincides practically with the rigorous result in the case of to = 6 and n — 12, so 
that this formula can be reasonably supposed to represent the equation of state of 
a large variety of substances. For a number of combinations of values of to and n 
sets of curves representing the isobars and isotherms according to this theory have 
been drawn by Gow, who has also extended the theory on the same lines to the 
calculation of the elastic coefficients of cubic crystals under the same conditions, 
and who has drawn curves showing the dependence of these quantities on tem¬ 
perature and pressure (Gow 1944). 

With all this material available it seemed worth while to resume once again the 
above-mentioned investigation in order to find out whether the theoretical equation 
of state can be made to fit the observations and whether the constants of the force 
law can be determined individually for each substance in this way. The present 
paper is devoted to this task. 

2 

As mentioned in § 1 the theory is restricted to a monatomic substance where the 
particles are arranged in a face-centred cubic lattice. Let N be the number of atoms 
per mol., V the specific volume per mol., and V 0 the same quantity for zero tem¬ 
perature and zero pressure. 

The potential energy <j> of an isolated pair of particles of the type (1) is conveniently 
written in the form 

#•)- “j-Ifor+ifoY 1 ), (2) 

n~m\ m\r) n\r) f K J 

where r 0 is the equilibrium distance between the particles and u their dissociation 
energy. 

It is further convenient to write the equation of state in a reduced form, that is, 
to introduce dimensionless parameters instead of the volume V, the pressure p 
and the temperature T by means of 

v = F/7 0 , vj = p/p 0 , t = T\Q, (3) 

where the constants p 0 and 6 of the dimensions of pressure and temperature respec¬ 
tively are defined by 

( 4 ) 


Nu = Rd = p 0 V 0 . 



91 


On the equation of state for solids 

The equation of state can now be written in the form 

vjv = F + Or, (5) 


where F and Q axe functions of volume but are independent of temperature. These 
functions can more easily be expressed by introducing a new parameter £ instead 
of V, defined by 



1 + £ = iW 3 , 

( 6 ) 

where 

s = n — m. 

(7) 

F and Q as functions of £ 

are now defined by 1 



F(i) — a£(l +£) m/s , 

( 8 ) 


m=c+b rnk’ 

(9) 


where a, b, c, K are constants depending only on the lattice structure and the 
exponents m and n, according to the following definitions: 



b = Ks/2, 


( 11 ) 



^ r ^ I ^ +m ^ +2 /^^ +2+s 


( 12 ) 

(13) 


Here the quantities are simple lattice sums which have been calculated and 
tabulated for all integer values of p between 4 and 15 for all three types of cubic 
Bravais lattices by Misra ( 1940 ). 

With the help of the relations (5)-(13) w can be Calculated as a function of £ 
for any pair of values of m and n, and finally from this the equation of state can be 
obtained in the usual way, giving the reduced volume v as a function of the reduced 
pressure w and the reduced temperature r. As an illustration the isobars v(r) for 
different values of w for the exponents m = 6 and n = 12 are shown in figure 2 , 
taken from Bradburn’s paper. 

The potential energy 0 of the lattice which is equal and opposite in sign to the 
heat of sublimation A per mol. of the substance at zero temperature is given by 

-0 = A = qNu, (14) 


where 

From ( 10 ) and (15) follows 


1 (S»J s+m)ls 

^ ~ 2 (Ss+m) mlS * 
m(s + m) 


(15) 

(16) 



92 R. Fiirth 

For the following calculations it is finally convenient to introduce the quantities 


7 , m + Ks 

g = b + c = 1 + ——, 

(17) 

7 , Tr 7 m+K 2 s 

h = bK+c = 1 +—^—• 

(18) 


In order to carry out the proposed comparison of the theoretical equation of 
state with measurements it is useful first to derive formulae for the compressibility k, 
the coefficient of thermal expansion a, and the dependence of these quantities on 



0 0-1 0-2 0-3 0-4 0-5 0-6 0-7 0-8 0-9 1-0 H 

Figures 2. Theoretical isobars for a cubic face-centred lattice with m= 6, n= 12. 


pressure and temperature. This can be done by calculating the first and second 
derivatives of v with respect to w and r. As under normal conditions p and T are 
small compared with p 0 and 9, and V is only slightly bigger than V 0 . the quantities 
m,T, £ are small compared with unity. The simplest procedure is therefore to expand 
the function v(vr, r) in a power series up to the second order. 

Using the relations (5), (6), (8) and (9), one obtains 

UT = a|+^(m+3)^ + (6 +c)T+ J’! (6+c) + 6(1 _ j g-)J^ T . (19) 

The solution of this equation with respect to g, to the same degree of approximation, is 
£ = ^I>-(6 + c)7-]+- Aw 2 + Bvjt + Ct 2 , 


( 20 ) 



where 


A=- 


On the equation of state for solids 

m + 3 


a*s 


^ ~ a* { s [ s + 6(1 —X)]} .j 

From (6) and (20) one obtains further, again correct up to the second order, 

3/3 


93 


( 21 ) 


o fO/J 


)] 

- < "[ ! T + i3(j +I ) (4+e) ]-' r, [ : 


3G _3/3 

s 2sa 2 \s"*" 


)(fe + c) 2 ]. 


(22) 


Substituting here for A , B, G from (21) one gets with the help of (17) and (18) 


i 3 r , , ,3(9 + 2 m + s) 
v = 1- [w-gT] + xv 2 - —- + TUT 

CIS 


+T *^[ a+ 


3 + 2m —s 


']■ 


(23) 


which is the required reduced equation of state for small values of pressure and 
temperature. From (23) by differentiation 

3 

+ s z a z 

3 + 2 m — s 


3 v 3 3(9 + 2w& + s) 

dw as ' « 2 « 2 


3 r. 2(3 +to) 1 

, ’ + rfL' !+ - < *> 


dv 3g 3g f , 3 + 2m-s 1 3 f, 2(3 + m) 

v- = — + -\ 2h + --- g r + — h+-A——g vr\ 

or as sa 2 |_ s J sa l s 


(24) 


and hence for the limiting case us = 0, r = 0 

/3u\ _ 3 /3iA _3 g /dv /9tA __ 

\0XZ7/ O as’ \3r/ 0 — as’ [dr/dujj 0 ~ ^ 


(: 


9V\ 3(9 + 2 m+s) (&u\ 3£f . 

dm 2 ) 0 s 2 a 2 ’ \3t 2 / 0 sa 2 _ 


3 + 2 m—s 


']■ 


\3tzt3 t/ 0 


_ 3 r i 2(3 +to) 
sa 2 \_ s 


9 • 


(25) 


The compressibility k , the thermal expansion coefficient a and their derivatives 
with respect to pressure and temperature can, in virtue of (3) and (4), be expressed 
in terms of the above quantities as follows: 

i ldV\ .IM ) 

K °-~V 0 [dp) 0 - p 0 \dm) 0 ’ a ° ~ V 0 \dT) 0 ~ 6 \3t/ 0 ’ | 

1 ( dK \ _ K^iYl ±/<M - /W1 

LsWW I’ «8Wo L^/WJo’ 

i / 3/c \_i_/3a\ r ^ u mi 

a^\3T/ 0 “ a 0 K 0 Wo ~ [dvrdr/[dmdr)Jf 


( 26 ) 



94 R. Fiirth 

Substituting the expressions (25) in (26) one finally obtains the following relations: 



p 0 K 0 = 3 Isa, 

(27) 


6a 0 = 3 g/sa, 

(28) 


V 0 a 0 /ltK 0 = g, 

(29) 

1 /3/c' 

1 _ 9 + 2m 4-s 9 + m + ra 


K l [dp, 

'o 3 3 ’ 

(30) 

1 /da ) 

i 1 F2 sh , 1 


a§l3 T) 

| t -^ T + ( 3 + 2m-s)], 

(31) 

1 /die) 

i 1 (doc\ 1 r$h 1 


a 0 K 0 \dT) 

L--^y,-3L7 +2(3+m, j- 

(32) 


Equation (30) has been derived by Griineisen ( 1912 ) from his theory, and later 
independently by Bridgman ( 1923 ). More recently its correctness has been proved 
rigorously for the cubic face-centred lattice at zero temperature (Fiirth 1942 ) 

It has been further shown by Griineisen (x 9I2 ) on the basis of his theory that the 
quantity 

Vcc 

7 = ^> (33) 

where c„ is the specific heat at constant volume, should be independent of tem¬ 
perature for each substance; it is therefore usually called the ‘Griineisen constant’. 

s value should depend on the values of the exponents in the atomic force law It 
has been especialiy suggested by Griineisen ( 1926 ) that y might depend only on the 
exponent n of the repulsive force according to the simple formula 


r = 


n + 2 


(34) 


Tie calculation of y from (33) by means of the measured values of the quantities on 
the right-hand side of tins formula for a large variety of elements (Grttaeisen 1026 ) 
gave the result that y is not only constant for each substance separately but has So 
roughly the same value (-2-4) for all the elements with a L Zprions 

ZStc force wMch^th^f 40 ““ T!due * " 12 for exponent of the 

SStS T“bo“ U8ed m mffiy «- •* 

to r “ to temperatures where the 

C„ = 3i? • ( 35 ) 

■ is valid, (33) becomes identical with (29) by putting 

r = 9/ 3. • 


(36) 



95 


On the equation of state for solids 


As, however, g depends on to as well as on n, the use of formula (34) for calculating 
n becomes unjustified and also the conclusion that the repulsive force has nearly 
the same character for most substances. 

A relation similar to equation (32) has also been derived by Griineisen ( 1912 ), 
namely, j 9/f 

— = i (m + n + 6), (37) 


which coincides with the correct expression (32) only for to = 3. 

The right-hand sides of the equations (27)-(32) depend only on the exponents 
m and n and can therefore be calculated for any pair of values of to and n with the 
hdlp of the formulae ( 10 ), (13), (17), (18) and Misra’s tables of the lattice sums S^. 
The results of this calculation are contained in table 1 . The lattice sums are finite 
only for values of p smaller than 3; nevertheless, it can be seen directly that the 
expression on the right-hand side of (30) remains finite for all finite values of to 
and n and that it is a linear function of these quantities. The numerical calculation 
shows further that the same applies to the quantities on the right-hand sides of 
(29), (31) and (32), and that they are practically linear functions of s for any fixed 
value of to or n inside the range accessible to numerical calculation. It is therefore 
easy to extrapolate the values of these quantities for values of m and n smaller 
than 3. The expressions (27) and (28), however, vanish for to = 3 and are not defined 
for to < 3. They also contain the unknown quantities p 0 and 6. 

In order to obtain relations for comparing the theoretical and the measured values 
of the thermal expansion coefficient and the compressibility one can eliminate the 
quantities p 0 and 6 from (27) and (28) with the help of (4) and (14), introducing 
instead the sublimation energy A. One has 

A = qRd = qp 0 V 0 , (38) 


which combined with (27) gives 


KqA — 


sa ’ 


(39) 


which again by means of (16) can be transformed into 

9F 0 . 9F 0 . 

mn' 


K n A = , 

to(<s+to) 


(40) 


Thi s relation has been derived by Griineisen ( 19 x 2 ) and by Bom & Lande ( 1918 ), 
and it is generally valid for all values of to and n. 

Similarly one obtains' instead of (28) 

*** (41) 


a oA = 


sa 


which by means of (16) is transformed into 

„ A- ^ 

0 to(s+to) mn’ 


(42) 


This formula, of course, provides no new information, as it is a direct consequence 
of (29) and (40). 



96 


R. Furth. 

Table 1 






V qOCq 

1 (fa\ 

i /a/c > 

\ 


m 

s 

Po*o 

6cCq 

Hkq 

*1 Uvo 

a 0 /c 0 \dTj 

I o A/Tc 

vjv 0 

4 

1 

0-00705 

0-0288 

4*0905 

4-2199 

5-1049 

11*49 

1*405 

4 

2 

0-00963 

0-0446 

4-6297 

4*6289 

5-4811 

14-49 

1*299 

4 

3 

0*01080 

0-0557 

5*1579 

5-0336 

5-8501 

17-28 

1*238 

4 

4 

0*01122 

0-0637 

5-6812 

5*4286 

6-2143 

19-94 

1*198 

4 

5 

0-01120 

0-0695 

6*2010 

5-8157 

6-5412 

22-54 

1*170 

4 

6 

0-01098 

0-0738 

6*7173 

6*1959 

6*9313 

25-06 

1*148 

4 

7 

0*01063 

0-0769 

7-2304 

6-5699 

7*2850 

27-53 

1*131 

4 

8 

0-01024 

0-0793 

7-7420 

6-9392 

7-6363 

29*97 

1*119 

4 

9 

0-00983 

0-0811 

8*2520 

7-3047 

7-9857 

32*38 

1-108 

4 

10 

0-00943 

0-0826 

8*7615 

7*667 

8*334 

34*77 

1*098 

5 . 

1 

0-01586 

0-0830 

5*2307 

5*2095 

5-9381 

13*52 

1*280 

5 

2 

0-01667 

0-0958 

5-7799 

5*6731 

6*3365 

15*97 

1-221 

5 

3 

0*01658 

0-1048 

6-3189 

6-1178 

6-7256 

18-27 

1*184 

5 

4 

0*01608 

0-1102 

6*8506 

6*5475 

7*10*71 

20-46 

1-158 

5 

5 

0-01539 

0-1135 

7*3757 

6-9639 

7*4819 

22-57 

1*140 

5 

6 

0-01465 

0-1157 

7*8980 

7-3713 

7*8523 

24-63 

1-125 

5 

7 

0*01391 

0-1171 

8-4161 

7-7695 

8-2181 

26-65 

1*113 

5 

8 

0*01320 

0-1179 

8-9300 

8-1610 

8*5805 

28*62 

1*103 

5 

9 

0-01255 

0-1186 

9-4465 

8-5473 

8-940 

30-60 

1-095 

6 

1 

0-01849 

0-1178 

6-3730 

6-2631 

6-7982 

15-01 

1*208 

6 

2 

0-01803 

0-1248 

6-9239 

6-7499 

7*2083 

17-05 

1*174 

6 

3 

0*01723 

0-1286 

7*4641 

7-2153 

7*6077 

18*96 

1*151 

6 

4 

0*01631 

0-1304 

7*9952 

7-6627 

7-9980 

20-77 

1*133 

6 

5 

0-01540 

0*1312 

8*5197 

8-0952 

8-3809 

22-53 

• 1-121 

6 

6 

0-01452 

0*1313 

9*0420 

8-5178 

8-7589 

24-24 

1-108 

6 

7 

0-01370 

0*1310 

9*5600 

8*9305 

9-1319 

25*92 

1*099 

6 

8 

0-01294 

0*1304 

10*081 

9*340 

9*503 

27*59 

1*092 

7 

1 

0-01785 

0*1339 

7*5012 

7*3342 

7-6671 

16*07 

1*164 

7 

2 

0-01691 

0*1361 

8*0473 

7-8304 

8-0819 

17*77 

1*144 

7 

3 

0-01591 

0-1366 

8*5849 

8-3065 

8-4866 

19*38 

1*129 

7 

4 

0-01493 

0-1360 

9*1114 

8*7621 

8*8810 

20*91 

1-116 

7 

5 

0*01402 

0-1351 

9-6345 

9-2052 

9*2693 

22*39 

1*106 

7 

6 

0*01318 

0-1338 

10*154 

9*6367 

9-6517 

23-85 

1*100 

7 

7 

0-01241 

0-1325 

10*678 

10-065 

10*033 

25-31 

1-090 

8 

1 

0-01606 

0-1383 

8*6127 

8-4075 

8-5371 

16-83 

1*140 

8 

2 

*0*01502 

0-1374 

9*1511 

8*9057 

8-9529 

18-26 

1-125 

8 

3 

0-01405 

0*1360 

9-6801 

9*3833 

9*3583 

19*62 

1-112 

8 

4 

0*01315 

0*1342 

10-204 

9*8454 

9*7560 

20-93 

1-103 

8 

5 

0-01234 

0-1323 

10*724 

10*295 

10*147 

22*21 

1-093 

8 

6 

0-01159 

0-1304 

11*251 

10*742 

10-537 

23-51 

1*090 

9 

1 

0*01404 

0-1362 

9-7016 

9*4708 

9-4021 

17-34 

1-110 

9 

2 

0*01308 

0-1338 

10-233 

9*9683 

9-8175 

18*56 

1*110 

9 

3 

0-01222 

0-1315 

10-757 

10-448 

10*224 

19*74 

1*100 

9 

4 

0*01144 

0-1290 

11-277 

10*913 

10*623 

20-89 

1-092 

9 

5 

0-01073 

0*1277 

11*807 

11-378 

11*022 

22*06 

1*075 

10 

1 

0-01215 

0*1309 

10-771 

10*522 

10-261 

17*67 

1-100 

10 

2 

0*01131 

0-1276 

11-285 

11-009 

10*671 

18-70 

1*100 

10 

3 

0*01058 

0-1249 

11-807 

11-491 • 

11-079 

19*74 

1*090 

10 

4 

0-00991 

0-1224 

12*351 

11-983 

11-492 

20*86 

1*082 

11 

1 

0*01051 

0*1243 

11*824 

11*563 

11-115 

17*90 

1*095 

11 

2 

0-00979 

0*1208 

12*341 

12-055 

11*527 

18-83 

1*085 

11 

3 

0-00917 

0*1183 

12-896 

12-571 

11*952 

19*89 

1-082 

12 

1 

0*00910 

0-1171 

12-868 

12-598 

11-965 

18-05 

1-082 

12 

2 

0-00842 

0*1132 

13*443 

13-145 

12-406 

19-10 

1*080 



On the equation of state for solids 


97 


3 

As already pointed out at the end of § 1 it is the aim of this investigation to ascer¬ 
tain whether a force law of the type (1) is compatible with the properties of solid 
elements. The procedure adopted was first to derive a value for the product mn of 
the exponents in the force law for a particular substance from the observed values 
of /c 0 , Vq , and S by means of formula (40). If then the sum m + n can be calculated 
in a similar way, both exponents can be determined separately. 



Figure 3. Diagram for the determination ofm + w from specific 
volume, compressibility and thermal expansion. 

The most direct way of doing so is to use formula (30) by applying it to measure¬ 
ments of the dependence of compressibility on pressure. Further material can be 
obtained by using the relations (29), (31) and. (32). For this purpose sets of curves 
were drawn showing the right-hand sides of these expressions as functions of m for 
integer values of m + n by using the figures of table 1 and the extrapolations of these 
figures mentioned in § 2. The corresponding graphs are shown in figures 3-5. With 
the help of these graphs one can now derive values for m+n from measurements 
of a and its dependence on temperature, and further from the dependence of k on 
temperature in such a way that they are consistent with the already determined 
values of mn . 

Table 2 contains the experimental data required for carrying out this task for all 
the elements for which reliable measurements were available. The values of F, the 


VoL 183, A. 


7 




Figure 4. Diagram for the determination of m + n from the temperature 
dependence of the thermal expansion coefficient. 



Figure 5. Diagram for the,determination of m+n from the 
dependence of compressibility on temperature. 





99 


On the equation of state for solids 

molar volume, and those of A, the sublimation heat per mol., are taken from the 
Landolt-Bornstein tables (Roth & Scheel 1923 - 36 ). The values of /c, the com¬ 
pressibility at low pressure, are taken from the extensive measurements of Bridg¬ 
man ( 1931 , 1938 , 1940 , 1941 ). The limiting values of d/c/dp were also derived from 
these measurements by graphical extrapolation for the more compressible substances 
and by using the two constant empirical formula of Bridgman for the less com¬ 
pressible ones; these constants had to be corrected according to a remark by Slater 

X 3/c 

( 1940 ). The temperature coefficient of compressibility is also found from 

K 0 JL 

Bridgman’s experiments. Assuming that this coefficient is independent of pressure 

1 

one sees easily that the temperature dependence of is the same as that of 1/k; 


k 2 dp' 


1 3 K 


this shows the procedure for-obtaining the limiting values of for zero tem¬ 
perature and pressure from the data mentioned above; they are also tabulated in 
column 11 of table 2 . 

As the present theory is based on classical statistics and consequently only holds 
for temperatures above the Debye temperature, where quantum effects can be 
neglected, the limiting values a 0 of the thermal expansion coefficient are not the 
actual values of this quantity at zero temperature (which according to Griineisen’s 
well-known law should be infinitely small), but such values as can be extrapolated 
for zero temperature from measurements of the dependence of a on temperature in 
the region above the Debye temperature. For all substances for which measurements 
of this kind were available the limiting values of /? 0 , the coefficient of linear thermal 
expansion, have been extrapolated and tabulated in column 7 of table 2 . In order 

to get reliable values of the quantity — (^) in the same way the calculations must 

QC 0 \Ol / 0 

be based on extremely accurate observations. Only the experiments of a few of the 
many experimenters in this field (Austin 1932 ; Austin & Pierce 1933 ; Eucken & 
Dannohl 1934 ; Nix & MacNair 1941 , 1942 ) were found to satisfy this condition. 
1 / \ 

Values of — (I were calculated by numerical and graphical extrapolation from the 

data given by these authors; for substances where measurements had been made by 
different observers the averages of the respective results were taken. The final 
values are given in column 8 of table 2 ; they, together with the corresponding values 

of y ? 0 = Ja 0 in column 7, were eventually used for calculating the values of \ J 

a o\?-*/o 

in column 12 . 

1 / 3/e \ 

The values of- 1^=1 , tabulated in column 13 of table 2 , are partly calculated 

from the corresponding values in the preceding column and partly from the special 
measurements of Griineisen ( 1910 ). These figures show that the thermal coefficient 
of k is in the average about five times bigger than a, the thermal coefficient of V. 
As A decreases with increasing temperature and thus has a negative temperature 

7-2 



100 


R. Fiirth 


^__CO CO 

o' <5T S' 4< rH 


O iO i> o cp 
i> 6s do cb do 



I c»o 

pH 

1 S 


o 

sc 

1 ^ 

CO- 

Uo 

pH 

r*° 


i> cp 10 

ib ^ cb 


CO ooo'oo? 05 ^ rH ^ 1© l> CD l© ^ ^ ^ o ^ 
cbibcbcb<»cbcb©5'cibc^.cb4i'4<cbio,i©'t'^cbt'^ 


p l> (M © h ^ 

ib cb cb ib i> 


© p o h o w 
io ib tH ib cb cb bo 


DCOOOlOODpMMNCCl 

Ncbcbcbcbibcbbocb^^ 


l©OCDi?OI>^C©{ttl>i^^^l©'c©CO©S | H CO >p H © 

ib'^ibcbibi>cb l »bibcibo > N i ^cb'^ib I i> i© i© cb t> 


ns |cc» i-h 


oo p o M 9 ©J ^ ^ ^ I I 

bb©l©lrH©lcD©l©>^c : -©J©lCSI>©T|r-IO^Tf<iP©J05C©COI> CO t- ^ CO 05 t> t- CO 
rtHHH5QCO'?HiMoqM(MHM'«iMClMNMiM©NWH(N CO CO M H N N 'dM^H 


^ CO N CO 

©HW0t>^t>C?5©^0 

lOI>00©ffiHr-IH ©l CO 


CO O COCD^COCDC©COCOO^H»>101© . cpcpcoip 

cbr^©icbr^d5i-Hcbcb'^ibc2cbcb^cb ( r-n ©i ©i ph do 


* IEh o 

CO jco i—( 

h iv: x 


00 05 ©1 CD CD 

©i ib ©v os ib 


TflCO©lCpl©l>l>C©COCOlO 

lb^lbOrHcb^©lcbrH©l 


OO ©l CD CO Ol ^ 

!'oOOO©Icb©l'4<''#OOOl>iO 
>10 0 0 0 l© CO CO 00 ©l 

105000 H CO CO w 

" CD OA 


CO CD CD O 
NOOOHHHH 
l© 
tH 


©3 05 t— O l© O 05 ‘ i—J l© rjH 00 ©pH ©1 1C © 05 CO CD 

r> CD t-DlOOOpt-HNO^plOW^HCCI^eCCOCOplOCO^CCOOlOWlOCC 

cbibioi^i^ooo6(NibdboA©icb6rHcsjooiH©iibcs*©ioo6b50oo6o 

i—I CO 10 CO i—I 


COOQOfHOCSICpCpl>l>^l>OCp05005^010 ^ CD l© CO tH 

6ww4icb6H6obH®6ibcbN66H>iboi>M©^oi>HtH 

w ^’C0©l©l©lrH00C005l0C0^’T^^C0©lfH00CDTH05Ol>'^C0l0 


^ 05 CD 

cb cb 

g rtN 


ObiOOO 
cb cb ib cb ph 
-h ©i tH io i> 


r> os co i© ©i ©i C 

9 HCOWCO*©C01>MQOHOOpOp^cpCONHCS|^Cp^ 

■HC^oo^cbibcbcbobcb^cbo^t^cbcsicbdbebobrHr^b 


©l ©io-^ IOCOOOOCO 

OC^^CDCp«©|i-H©l'^Cp*^CpiOlOrHr>COry» 

©„t>eb©aicbdbebdbpHr>C5C5cbi>i>cbcbC5 

i”I H H H H t—I rl ©I 





101 


On the equation of state for solids 

coefficient, it can be reasonably assumed (within the accuracy of this investigation) 
that the temperature coefficient of V / kA is very small. Consequently this quantity 
has been calculated from the measurements of V, A, and k at room temperature 
contained in the columns 2 ,3, and 4; the results are shown in column 9. Since, finally, 
the temperature coefficients of a and /c are of the same order of magnitude, and that 
of F, as already mentioned, about five times smaller, the quantity Voc/k can also 
be considered as being practically independent of temperature; the figures given in 
column 10 of table 2 are calculated on this assumption. 

Table, 3 contains the values of mn and m+n, calculated from the data of table 2 
in the way indicated above, and finally, in the last two columns, the probable values 
of the exponents m and n separately. The less reliable figures are bracketed. The 
values for the rare gases are quoted according to Lennard-Jones ( 1937 ), derived 
from measurements on the thermal properties of these substances in the gaseous 
state near the critical point. 

One sees that substances belonging to the same group of the periodic system mostly 
have the same or very similar values of m and n. But the variation of the values 
throughout is not very wide except in a few cases. This accounts for the fact that 
quite reasonable results had been obtained in former papers by the present author 
(Furth 1941 <x, 6 ) where he had tried to account for the mechanical and thermodynami¬ 
cal properties of solids on the basis of Born’s theory in its first form (Born 193 9 ) where 
fixed values of the exponents (m = 6 and n = 12 ) had been used. 


4 

The fact that consistent values of the exponents m and n in the atomic force law 
can be derived from measurements at low pressure and temperature for a large 
number of substances can be regarded as evidence for the correctness of the theory. 
Further evidence of a still more convincing character can be obtained by comparing 
the measured dependence of volume on temperature and pressure over a wide range 
with the dependence predicted by the equation of state (5) in connexion with 
equations ( 6 )-( 12 ), when the above-mentioned values of m and n are substituted. 
Ideal experimental material for this programme is provided by recent measurements 
of Bridgman on the compression of elements under very high pressure up to 100,000 
atm. (Bridgman 1940 , 1941 ). 

In order to carry out this task it is convenient to write equation (5) by using 
(3), (16) and (38) in the form 




(43) 

with 

m- m(s 3 + s m> «i+£r*. 

(44) 

Writing further 

SV = V-V 0 

(45) 



102 


R. Fiirth 


for the difference between the actual volume V and the volume F 0 at zero pressure 
and zero temperature, one has further 

. v=l + SVIV 0 ’ (46) 

and £ = (l + SV/V 0 ) sl3 — l. (47) 

By using these equations together with the equations (9), (11), (12) and (13), one 
can now calculate p as a function of SV for any temperature T if m, n, .and A 
are given. 

Table 3 


element 

mxn 

from 

9Fo 

fc 0 A 

m+n 

from 

Fofo 

Rk 0 

1 

owl t—l1 

o 

m+n 

„from 

1 / Sa\ 

«iW 0 

m + n 
from 

a 0 /c 0 \dTj o 

m 

n 

Ne 

' — 

— 

— 

— 

— 

6 

12 

A 

— 

— 

— 

— 

— 

6 

12 

Kr 

— 

— 

— 

— 

— 

6 

12 

Xe 

— 

— 

— 

— 

— 

6 

12 

Li 

9 

7-8 

10 

— 

8 

1-5 

6 

N a 

12 

9 

6 

— 

9 

2 

6 

K 

12-5 

9 

9 

— 

12 

2 

6 

Rb 

11 

8-9 

(9) 

— 

— 

2 

6 

Cs 

12 

8-9 

(15) 

— 

— 

• 2 

6 

Cu 

26 

12-13 

9 

13 

18-22 

4 

7 

Ag 

32 

14-15 

17 

11 

15-18 

4-5 

7 

Au 

42 

18-»19 

(27) 

13 

— 

5-5 

8 

Be 

24 

8 

15 

— 

12-14 

4 

6 

Mg 

27 

9-10 

(12) 

— 

— 

4 

6 

Ca 

22 

11 

10 

— 

13-15 

4 

6 

Sr 

22 

—„ 

5 

— 

— 

(4) 

(6) 

Ba 

19 

— ‘ 

5 

— 

— 

(4) 

(6) 

Zn 

37 

12-13 

(2) 

— 

— 

5 

7 

Cd 

42 

11 

(6) 

— 

(14-17) 

6 

7 

Hg 

51 

12-13 

(6) 

— 

— 

6 

8 

B 

(30) 

8 

(12) 

— 

— 

(4) 

(V) 

A1 

24 

12-13 

9 

(19) 

19-24 

4 . 

7 

T1 * 

34 

15-16 

(12) 

— 

— 

4 

9 

C 

27 

8-9 


— 

— 

(4) 

(6) 

Si 

92 

11-12 

— 

— 

— 

(8) 

(12) 

Sn 

29 

12-13 

9 

— 

19-24 

3 

10 

Pb 

36 

16-17 

(11) 

(21) 

13-15 

,3 

12 

As 

16 

9 

(10) 

— 

— 

(2) 

(8) 

Sb 

27 

(5-6) 

(9) 

— 

9 

(4) 

(7) 

Bi 

— 

(8-9) 

12 

— 

9 

(4) 

(7) 

Cr 

33 

8 

(12) 

— 

? 

5 

. 7 

Mo 

37 

11 

— 

? 

8 

5 

7 

W 

34 

11-12 

6 

? 

? 

5 

7 

Se 

13 ‘ 

— 

9 

— 

— 

3 4 

5 

Mn 

29 

14-15* 

(34) 

— 

15-18 

3 

10 

Pe 

27 

11-12 

6 

— 

19-22 

4 

7 

Co 

— 

12-13 

9 

— 

18-22 

4 

7 

Ni 

27 

13-14 

9 

* 18-22 

11-13 

4 

7, 

Pt 

43 

16 — 17 

16 

11 

18-22 

5*5 

8 



103 


On the equation of state for solids 


The quantity measured and tabulated in Bridgman’s publications is the actual 
compression A V, that is, the difference between the volume V at zero pressure 
and the volume V at pressure p, both at the same temperature T, divided by V, 
which, as the difference between V and V 0 is small compared with J^, can be written 
in the form 


or 


V-V v-v 
AV = - Tr ss- 


V 


r n 


AV = S 1-W 
Vo V 


(48) 


where 


SV = F-F 0 . 


(49) 


As according to (45) and (49) SV = SV for p = 0, SV can he easily found from the 
above calculations as the value of SV for which p becomes zero. This makes it pos¬ 
sible, by means of (49), to calculate p as a function of A F, and so to draw a curve 
representing the theoretical dependence of A V on p which can be directly compared 
with the experimental results. Finally, the theoretical value of 5F can be compared 
with the experimental value of this quantity derived from observations on the 
thermal expansion at zero pressure. 

, The calculations were carried out for all substances for which values of the 
exponents m and n had been determined and for which, at the same time, measure¬ 
ments of the compression up to sufficiently high pressures had been performed 
without showing any indication of a ‘pressure transformation’ of the lattice. The 
results are shown in figures 6-10 and in table 4. It can be stated that the agreement 
between theory and experiment is in general very satisfactory as regards the 
compressions up to the highest pressures at different temperatures, and the thermal 
expansion at zero pressure. 


5 

Figure 2 shows that each isobar extends up to a certain critical temperature T e 
where it turns over. This indicates that by reaching this temperature the lattice 
becomes unstable, and hence the upper branch of the isobar has no real significance. 
One is tempted to identify T c with the melting-point T m and so to explain the process 
of fusion as the breaking up of the lattice by the thermal movement. This idea was 
first expressed by Herzfeld & Goeppert-Mayer ( 1934 ), and later Born derived the 
general conditions for the stability of a crystal lattice and showed that melting 
criteria could be expressed in the form of certain conditions for the elastic con¬ 
stants, one of which is the vanishing of the rigidity modulus: c 44 = 0 (Born 1939 ). 
In the cases investigated numerically by Gow ( 1944)5 the sharpest of these instability • 
criteria always practically coincides with the criterion T m = T c . 

There are, however, two strong objections against this simple explanation of 
melting. First, if it were correct, the coefficient of thermal expansion a, which is 
given by the slope of the isobars, would increase very rapidly on approaching the 



104 


R. Fiirth 



Figure 6 . Compression curves of KL and Li (x measured by Bridgman 1923 , 
4 -measured by Bridgman 1941 ). T = 300° K. 



Figure 7. Compression curves of Na and Rb ( x measured by Bridgman 1923 , 
+ measured by Bridgman 1941 ). T = 300° K. 


Table 4 


element 

dV/V Q 

(theor.) 

wiv* 

(exp.) 

Li 


0*050 

0*050 

Na 


0*063 

0*058 

K 


0*073 

0*072 

Rb 

•(2’=300°) 

0*082 

0*079 

Ca! 


0*028 

0*025 

Zn 


0*033 

0*026 

ca 


0*030 

0*027 



SV/V a 

SV/V 0 

element 

(theor.) 

(exp.) 

si} (T=30 ° O) 

0*024 

0*017 

0*026 

0*021 

Sb (2 T =200°) 

0*014 

0*005 

Sb (T=Z 00°) 

0*025 

0*008 

Se (T= 200 °) 

0*033 

0*031 

Se (T=300°) 

0*050 

0*046 



On the equation of state for solids 


105 



Figure 8 . Compression curves of Cd and Zn (-f measured by Bridgman 1941 ). T = 300° K. 



Figure 9. Compression curves of T1 and Sn (-f- measured by 
Bridgman 1938 and 1941 ). T = 300°K. 



Figure 10. Compression curves of Se and Sb for different temperatures (+measured by 
Bridgman at room temperature 300° K, x measured at C0 2 temperature 200° K. 1940 ). 



106 


R Fiirth 


melting-point and become infinitely large at this point itself. Observation shows 
that the actual increase of a with temperature is slight even up to T m . Secondly, 
it has been shown by the present author in a former paper (Fiirth 1941 a) that the 
value of SVJV 0 , the relative increase of volume over its zero point value,' calculated 
for the turning-point of the isobar p = 0 (according to the original theory of Born) 
is at least twice as big as the actually observed volume increase in a large number 
of cases* 

It seems worth while to take up this question again on the basis of the new 
equation of state. The turning-point of the isobar p = 0 is determined by the con¬ 
dition 


(dv\ 

U- “00, 

\OTJ w * 0 


which, in virtue of ( 6 ), can be written in the form 


This leads, with the help of (5), ( 8 ) and (9), to the equation 

{l+K£ c )[{b+c) + {b + Kc)£ c }(l + ^£^-b£ c {l + £ c ){l-K) = <) (52) 

for the value £ c of £ corresponding to T = T c and p — 0. Using ( 11 ), ( 12 ) and (17) 
one can bring this equation into the form 


. C 0 +C^ c +G^c+Gd 3 c = 0, 


where 


C<> = g, 


G x = g\2K+ 


c. = k i+ 


K + -\+ 1 + 


m+Ks 


-jemim¬ 


as £ is always small compared with unity even up to £ c , a first approximation for 
solving the equation (53) can be tried by neglecting all terms except the first two. 
This leads to 

£ c = -Uk+^Y 1 . (55) 


In the same approximation one obtains from (5) (with ru = 0 ) * 


«g e _a | 

b + c g' 


s+ml -1 



107 


On the equation of state for solids 

By means of (38) tins expression can be transformed into 

A = gRe = t g + w \ 

T c T c t c a y s J 

and finally in consequence of (16) 


Similarly from (55) ancl ( 6 ) 


A = SBgt | 2 Ks\ 
T c m\ s + m/' 


£=»c=(l + ^) 3/S 


1 — 


1 


“13/s 


2K + 


s + m 
s 


(57) 


(58) 


(59) 


As the right-hand sides of (58) and (59) can he calculated for any pair of "values 
of m and n, the ratios 'A/T c and VJV 0 can be evaluated. They are tabulated in the last 
two columns of table 1 . In order to check these results the values of r c obtained 
from (56) were compared with those values of r c which can be graphically determined 
in the cases investigated in the paper of Gow. As in all these cases the agreement is 
perfect within the accuracy of the calculation, the approximation used can be 
regarded as justified. The values of v c according to (59), however, are just about 
one-half of the actual values, which fact is, of course, due to the very steep ascent 
of the isobars near the turning-points. The figures in the last column of table 1 have 
therefore to be considered as fictitious, corresponding to an extension of the isobars 
in parabolic shape from low values of r up to t c . The correct values could be easily 
obtained from (53) and (54) by numerical calculation, but this is hardly worth while, 
as r c is in all cases very much larger than r m , as will be, shown immediately. 

Table 5 contains in column 3 the values of A/T m for the elements, calculated from 
the melting temperatures (column 2 ) and the sublimation energies (column 3, 
table 2 ). It also contains in column 4 the values of A/T c determined from (58) by 
using the values of m and n previously obtained and tabulated in table 3. The ratio 
of these two quantities gives TJT m which is tabulated in column 5. It is very nearly 
equal to one for the four rare gases; for all the other elements, with a few exceptions, 
it is roughly constant with an approximate value of 3*5. 

It appears from this fact that T m for all these substances is well inside the region 
of the isobars where they can be considered to be parabolic in shape. It seems 
reasonable therefore to compare the observed values of v m with the fictitious values 
of v c which follow from (59). This is also shown in table 5, which contains the observed 
values SV m /V 0 in its column 6 (see Furth 1941 a) and the values SVJV 0 in column 7. The 
ratio of these two quantities gives SVJSV m) which is tabulated in the last column of 
table 5. It is equal to 1*6 for argon, tfye only rare gas for which data were available 
(Simon, Ruhemann & Edwards 1930 ), and it is again roughly constant for nearly 
all the other elements with an average of about 3*5. ; 

The fact that the values of TJT m are nearly the same as those of SV c /SV m proves 
that the observed volume expansion at the melting-point is in agreement with the 



108 


K Ftirth 


present* theory. The fact that these values are roughly the same for most of the 
elements seems further to indicate that there might exist a relation between the 
mechanical breakdown of the lattice and fusion, but also that the two processes are 
certainly not identical. Melting occurs in a region where the lattice should be stable 
for all disturbances caused by the thermal movement. It is interesting to note that 
the ratio TJT m for the rare gases is exactly equal to unity which can hardly be a 
mere coincidence. On the other hand, as SVJ8V m is very much larger than unity in 
the case of argon it cannot be concluded that melting and mechanical instability 
of the lattice can be identified even in these cases. 


Table 5 


element 

T 

-*■ m 

A/T m 

(caL/deg.) 

A/T c 

(cal./deg.) 

TJT m 

(8V/V 0 ) m 
x 10 3 

(8V/V 0 ) C 
x 10 3 

8VJ8V, 

Ne 

24*4 

24*6 

24-2 

1*0 

— 

— 

— 

A 

83*8 

24*5 

24-2 

1*0 

68 

108 

1*6 

Kr 

116 

. 24*0 

24-2 

1*0 

— 

— 

— 

Xe 

161 

25*5 

24-2 

1*05 

— 

— 

— 

Li 

453 

79 

(20) 

3*9 

(75) 

310 

4*1 

Na 

371 

70 

(18) 

3*9 

(72) 

320 

4*4 

K 

337 

65 

(18) 

3*8 

(81) 

320 

3*9 

Rb 

312 

67 

(18) 

3*7 

(82) 

320 

3*9 

Cs 

301 

63 

. (18) 

3*5 

(81) 

320 

3*9 

Cu 

1356 

60*5 

(17) 

3*5 

69 

240 

3*5 

Ag 

<234 

56 

(17) 

3*3 

72 

225 

3*1 

An 

1340 

*68 

(18) 

3*8 

60 

180 

3*0 

Be 

1550 

35 

14-5 

3*8 

(78) 

300 

3*8 

Mg 

923 

36*5 

14-5 

2*5 

(75) 

300 

4*0 

Ca 

1080 

40 

14-5 

2*8 

(90) 

300 

3*3 

Sr 

1044 

38 

(14-5) 

2*6 


300 

— 

Ba 

977 

44*5 

(14-5) 

3*1 

— 

300 

— 

Zn 

693 

46 

16 

2-9 

60 

220 

3*6 

Cd 

594 

45*5 

15 

3*0 

54 

210 

3*9 

Hg 

234 

66 

17 

3*9 

(30) 

175 

5*8 

B 

2500 

(12) 

(17) 

0*7? 

63 

240 

3*8 

A1 

932 

72*5 

17-3 

4*2 

66 

240 

3*6 

T1 

576 

74*5 

22-5 

3*3 

(51) 

170 

3*3 

C 

3800 

52 

(14-5) 

3*6 

48 

300 

6*3? 

• Si 

1693 

55*5 

(21) 

2 *6? 

— 

100 

— 

Sn 

505 

138 

(28) 

4*9 

(36) 

160 

4*4 

Pb 

600 

78 

(34) 

' 2*3 

45 

120 

2*7 

' As 

1087 

25*1 

(25) 

1*1 

(17) 

220 

13? 

Sb 

904 

61 

(17) 

3*6 

(25) 

240 

9*6 

Bi 

544 

— 

(17) 

— 

(22) 

240 

10*9 

Cr 

(2070) 

(43) 

16 

2*7 

(54) 

220 

4*1 

Mo 

2870 

54 . 

16 

3*4 

(54) 

220 

4*1 

W 

3650 

55*5 

16 

3*4 

72 

220 

3*1 

Se 

490 

57 

(12-5) 

4*5 

(75) 

400 

5*3 

Mn 

1493 

46*5 

(28) 

1*7 

(93) 

150 

1*6 

Fe 

1803 

53 

17 

3*1 

(69) 

240 

3*5 

Co 

1763 

— 

17 

— 

72 

240 

3*3 

m 

1725 

56*5 

17 

3*3 

78 

240 

3*1 

pt 

2037 

61*5 

18 

3*4 

60 

180 

3*0 



On the equation of state for solids 


109 


6 

The rule about the melting-point referred to in the last paragraph can be expressed 
in the following form ^ = eTc , (60) 

where e has roughly the same value for all substances. If it is assumed that this rule 
has a real physical meaning, then one can reasonably expect it to be closely con¬ 
nected with another well-known rule about melting, namely Lindemann’s melting 
formula (Lindemann 1910 ). Such a connexion was first suggested by Born ( 1939 ), 
and it can be shown to exist as a consequence of the present theory. 

The proper frequency v 0 of the linear vibration of a pair of isolated atoms hound 
together by forces obeying the law ( 2 ) is given by 


4c7t% 

u — led = — Lbv §rg, 
mn u u 


(61) 


where /jl is the mass of an atom. Neglecting the slight difference between r 0 and the 
actual distance between nearest neighbours in the lattice, one has further for the 
face-centred lattice / IvvM 

(62) 

(the numerical constant in the case of a body-centred lattice is practically identical). 
Introducing further the atomic weight M by 

M = ju,N 9 (63) 

6 RmnN * 


one obtains from (61) and (62) = 


° “ MVi 2 Utt 2 


(64) 


Lindemann’s formula contains a different atomic frequency, namely, the "Einstein 
frequency’ v E of an atom in the lattice, vibrating under the influence of all the 
other atoms. A simple consideration shows that when only first neighbours are taken 
into account and the forces from the more distant atoms neglected, the two fre¬ 
quencies v 0 and v E are connected by v% = 4v§ in a face-centred lattice, and by 
v% = 8/3^o in a body-centred lattice. In the first case the quantity q, defined by 
(14), is equal to 6 , in the second case it is equal to 4; hence both the above formulae 
can be replaced by the single formula 

v% = 2/3 qvl (65) 

It will be supposed that this formula also holds approximately in the general case 
of the force law (2) with q given by (15). 

Substituting from (64) into (65) and using the relations (3), (7), (16), (56) and 


(60), one obtains 


where C T = 


v% =Cl 


MVI ’ 


( 66 ) 


J\l[2K{n-m)+n^. (67) 

The relation ( 66 ) is indeed Lindemann’s formula, where C L should be a universal 
constant. If the atomic frequency v E is determined from the dependence of the 



110 


R. Furth 


specific heat on temperature, the formula can be checked, since all the other quan¬ 
tities contained in it are measurable. Such investigations have been carried out 
before (see e.g. Blom 1913 ), and they show that the law holds with considerable 
accuracy for a large number of elements, although C L is by no means really strictly 
constant. Its average value is 3*1 x 10 12 . According to the present theory C L should 
be given by formula (67). As its,right-hand side depends on the exponents m and n 
it cannot be a constant which would account for the deviations from Lindemann’s 
law. But the range of variation of C L between the different elements is not very large 
and can be easily determined from the data of the tables 3 and 4. The calculation 
shows that C L should be contained between 1*8 x 10 12 and 3*3 x 10 12 . The experi¬ 
mental value quoted above is, indeed, within these limits. 


Reeekexces 

Austin, J. B. 1933 Physics , 3, 240. 

Austin, J. B. & Pierce, R. H. H. Jun. 1933 Physics, 4, 409. 

Blom, C. E. 1913 Ann. Phys., Lpz., 42, 1397. 

Born, M. 1923 Atomtheorie des festen Zustandes. Leipzig. 

Born, M. 1939 J. Chem. Phys. 7, 591. 

Born, M. 1940 Proc. Camb . Phil. Soc. 36, 160. 

Born, M. 1943 Proc. Camb. Phil. Soc. 39, 100. 

Born, M. & Bradburn, M. 1943 Proc. Camb. Phil. Soc. 39, 104. 

Bom, M. & Lande, A. 1918 Verh. dtsch. phys. Ges. 20 , 210. 

Bradburn, M. 1943 Proc. Camb. Phil. Soc. 39, 113. 

Bridgman, P. W. 1923 Proc. Amer. Acad. Arts Sci. 58, 234. 

Bridgman, P. W. 1925 Proc. Amer. Acad. Arts Sci. 60, 385. 

Bridgman, P. W. 1931 The physics of high pressure. London. 

Bridgman, P. W. 1938 Proc. Amer. Acad. Arts Sci. 72, 207. 

Bridgman, P. W. 1940 Proc. Amer. Acad. Arts Sci. 74, 21. 

Bridgman, P. W. 1941 Phys. Rev. 60, 351. 

Debye, P. 1914 Gottinger Vortrdge uber die kinetische Theorie der Materie , pp. 17ff. Leipzig. 
Eucken, A. & Dannohl, W. 1934 Z. Elektrochem. 40, 814. 

Fiirth, R. 1941 a Proc. Camb. Phil. Soc. 37, 34. 

Fiirth, R. 19416 Proc. Camb. Phil. Soc. 37, 177. 

Fiirth, R. 1942 Proc. Roy. Soc. A, 180, 285. 

Gow, M. 1944 Proc. Camb. Phil. Soc. (in the Press). 

Griineisen, E. 1910 Ann. Phys., Lpz., 33, 1239. 

Griineisen, E. 1912 Ann. Phys., Lpz., 39, 257. 

Griineisen, E. 1926 Zustand der festen Korper, Hamdb. d. Phys. Berlin. 

Herzfeld, K. & Goeppert-Mayer, M. 1934 = Phys. Rev. 46, 995. 

Leonard-Jones, J. E. 1937 Physica (iv), 10, 941. 

Lindemann, F. A. 1910 Phys. Z. 11, 609. 

Mie, G. 1903 Ann. Phys., Lpz., 11 , 657. 

Misra, R. D. 1940 Proc. Camb. Phil. Soc. 36, 175. 

Mott, N. F. & Jones, H. 1936 The theory of the properties of metals and alloys. Oxford. 

Nix, F. C. & MacNair, D. 1941 Phys. Rev. 60, 597. 

Nix, F. C. & MacNair, D. 1942 Phys. Rev. 61, 74. 

Roth, W. A. & Scheel, K. 1923-36 New edition of Landolt-Bornstein, Physikalisch-chemische 
Tabellen. Berlin. 

Simon, F., Ruhemann, M. & Edwards, W. A. M. 1930 Z. phys . Chem. B, 6 , 331. 

Slater, J.. C. 1940 Phys. Rev. 57, 7.44. 

Stem, O. 1916 Ann. Phys., Lpz., (4), 51, 237. 



Continuous spectra in flames: the role of 
atomic oxygen in combustion 

By A. G. Gaydon, D.Sc., Chemical Engineering Department , 

Imperial College , London , S.W.l 

(Communicated by Sir Alfred Egerton , Sec.R.S .— 

Received 9 November 1943) 

[Plate 2] 

The causes and types of continuous' spectra emitted by flames are discussed and their im¬ 
portance stressed. It is shown that the yellow-green continuous spectrum emitted by some 
flames containing oxides of nitrogen is probably identical with the spectrum of the air after¬ 
glow and is therefore due to a reaction between nitric oxide and atomic oxygen. 

It thus becomes possible to test for the presence of atomic oxygen in a flame by admitting 
nitric oxide and observing if a yellow-green emission results. For the carbon monoxide flame 
there appears to be a high concentration of atomic oxygen, both for the dry and moist flame. 

The combustion mechanism-is discussed in detail using this knowledge. For the hydrogen 
flame a little atomic oxygen is present, but results do not permit of definite conclusions. For 
hydrocarbon flames thjre is no sign of atomic oxygen in the inner cone, and this is taken as 
strong evidence in favour of a peroxidation rather than a hydroxylation mechanism. 

Introduction 

Of the three general classes of spectra observed in flames, line, band and continuous, 
the last have received least attention. Line spectra are, of course, due to electronic 
transitions between the quantized energy states of atoms; although of interest for 
the purpose of chemical analysis and in a few special cases such as the sodiumdine 
reversal method of temperature measurement, line spectra are seldom of much 
importance in elucidating details of combustion mechanisms. Band spectra in the 
visible and ultra-violet regions correspond to similar transitions between quantized 
energy states of molecules; the,emission and absorption of band spectra by flames 
have received a fair amount of attention in relation to combustion problems (for a 
general review see Gaydon 1942 a). Continuous spectra (in gases) correspond to 
transitions in which at least one of the states involved is unquantized. Continua 
therefore correspond to processes such as dissociation, ionization, and association. 
Thus, while band spectra are of value in telling us what molecules and radicals are 
present under flame conditions, the examination of continuous spectra might be 
expected to give information of even greater value about the actual processes taking 
place during the combustion. Unfortunately, in practice, continuous spectra are 
difficp.lt to identify with certainty owing to their characteristic lack of definite 
features. With band spectra, analysis of the structure gives precise information 
about certain molecular constants and so usually enables us to assign the bands to a 

[ 111 ] 



112 


A. G. Gaydon 

particular molecule. With continuous spectra this is not possible, and in only a few 
cases, such as the well-known hydrogen continuum corresponding to the process 
H 2 ( 3 27+)-> unstable H 2 ( 3 27+)-^H + H, can details of the process be stated with 
certainty. 

When alkali metals, or their salts, or, to a less extent, alkaline earth metals are 
introduced into a flame, strong continuous emission around the blue or violet region 
is obtained. Hartley ( 1894 ) reported observations of this type of continuum which 
has also been observed by the author when potassium or rubidium are introduced 
into either a hydrogen or a carbon monoxide flame. The position of the maximum 
of this continuum depends on the ionization potential of the alkali metal; moving 
to longer wave-lengths for metals with lower ionization potential. This and other 
types of continuum have been very fully reviewed and discussed by Tinkelnburg 
( 1930 , 1934 , 1935 ) who has developed Stark’s suggestion that the process responsible 
for this type of emission is the association of an electron and a positive ion to form a 
neutral molecule. Finkelnburg has used the term ‘electron radiation’ to describe 
this process and has shown that it is quantitatively of the right magnitude to account 
for the strength of the continuum. 

All flames show strong ionization, and, even when there is no evidence to show 
that the main constituents (e.g. H 2 or 0 2 ) are ionized, it is possible that small 
traces of impurity may result in enough ionization to give some continuous emission 
of the ‘electron radiation’ type as a background to the main features of the 
spectrum. 

Continua are also emitted when halogens are introduced into flames. Urey & 
Bates ( 1929 ) have examined these and concluded that they result from the reunion 
of an excited halogen atom and a normal atom to form a normal halogen molecule. 
Several other inorganic flames, such as those of phosphorus, arsenic and selenium, 
also give strong continuous emission (for references see Gaydon 1942 a), although 
their cause is unknown. 

The spectrum of the bright blue carbon monoxide flame appears under small 
dispersion to be continuous. More detailed examination (Weston 1925 ; Gaydon 
1940 ) reveals a very complex band structure. This is particularly well developed in 
.the flame at reduced pressure and also in the cool flame (Gaydon z 943 ). In hot flames 
at high pressure the structure is less outstanding, and the appearance differs little 
from that of a pure continuum. There is no doubt that the higher temperature 
increases the length of the branches of the rotational fine structure of the band 
system and so contributes to this change of appearance. Whether or not there is a 
genuine continuous emission as well as the complex band spectrum is uncertain. 
The maximum temperature for the C0-0 2 flame is, at least in theory, relatively 
high, and appreciable ionization and dissociation, both factors favouring the 
emission of continua, will result from this high temperature. 

The continuous emission from luminous organic flames, which is due to 
thermal emission from hot carbon particles, falls outside the scope of this 
investigation. 



Continuous spectra inflames 113 

CONTINUOUS EMISSION BY THE HYDROGEN BLAME 

The simple flame of hydrogen burning in air is practically non-luminous. The 
spectrum shows the OH bands in the ultra-violet and the vibration-rotation spec¬ 
trum of H 2 0 in the near infra-red. The flame normally appears faintly orange, this 
coloration being due to traces of impurities, chiefly 1ST a and CaO. Higher tem¬ 
perature flames of hydrogen do, however, show appreciable luminosity. Finkeln- 
burg ( 1930 ) gives references to many old papers, some of which are not available to 
the author, as copies have been removed to safety for the duration of the war. 
Dibbits in 1864 found that hydrogen burning with air and other gases showed a 
weak continuum with maximum in the green. Frankland in 1868 observed that the 
spectrum of hydrogen burning in oxygen at ten atmospheres pressure was extremely 
bright and entirely continuous in the, visible region, while Liveing So Dewar ( 1891 ) 
studied the rate of increase of the brightness of this continuum with pressure up 
to 40 atm. 

The present experiments indicate that high-temperature flames of hydrogen 
burning with pure oxygen are blue, and that the pale yellowish green coloration 
only occurs in the presence of nitrogen. The blue luminosity is particularly well 
marked with a simple flame of oxygen burning in hydrogen at a quartz jet. This 
flame has a bright blue base, and is sometimes surrounded by a red outer mantle 
the colour of which is due to emission of the vibration-rotation bands of H 2 0 (Gaydon 
19436 ) which extend from the infra-red right up to the visible. A group of spectra 
of the blue part of this flame of oxygen burning in hydrogen is shown in the plate. 
There is strong emission throughout the blue, violet and near ultra-violet. The OH 
bands are very strong in this source and the very long branches of the rotational 
fine structure can be observed well down into the visible region, and there is some 
difficulty in distinguishing between true continuous emission and apparent con¬ 
tinuum resulting from the unresolved fine structure of these OH bands. The blue 
colour is probably largely due to continuous emission, and this may be of the 
‘electron radiation’ type. The possibility of association processes between free 
atoms present in these high-temperature flames cannot, however, be ruled out and 
may well contribute, along with other factors, to the blue colour. 

The oxy-hydrogen blow-pipe flame burning in air has a blue core, but the outer 
part of the flame, especially the tip, shows a pale yellow or yellow-green luminosity* 
The colour is rather difficult to describe, and in this type of flame is usually confused 
by admixture with orange coloration due to the presence of Na and CaO as im¬ 
purities; the true colour is probably best described as an unsaturated yellow-green, 
although it has on occasion been described as greyish green or brownish green or 
just as yellowish. It will be referred to as yellow-green throughout this paper. The 
same colour is produced when a trace of air or nitrogen is admitted to a flame of 
hydrogen burning in oxygen. Flames of hydrogen burning with nitrous oxide under 
suitable conditions show the yellow-green colour strongly. It was noticed that the 
oxy-hydrogen blow-pipe in air always smelt of oxides of nitrogen when the yellow- 


VoL 113 . a. 


8 



114 


A. G. Gaydon. 

green coloration was well developed, and it seemed that there must be some con¬ 
nexion between the emission of this yellow-green continuum and some reaction 
involving an oxide of nitrogen. The colour is very similar to that of the afterglow 
of air in an electric discharge at reduced pressure. The author has therefore carried 
out a number of experiments to compare the yellow-green continuum obtained by 
deliberately introducing oxides of nitrogen into flames with that emitted by the air 

afterglow. 

* 

Experimental 

The spectrograph employed was a large aperture (//4) glass prism instrument. 
This gave a visible spectrum about 5 cm. long, this moderate dispersion in the 
visible region combined with high speed making the instrument particularly suit¬ 
able for the study of the yellow-green continuum emitted by the relatively weak 
flame sources. 

The photographic plate characteristics desired were uniform sensitivity through¬ 
out the visible region combined with fairly high contrast and speed. Several makes 
and varieties of plates were tried, but no really satisfactory type was found. All the 
panchromatic or ‘long-range’ types showed low or irregular sensitivity in the green, 
while ortho'chromatic types failed, of course, at the red end. Ilford Rapid Process 
Panchromatic were actually used for all the investigations, although these show a 
double maximum of sensitivity in the red and not very high sensitivity in the green, 
as may be seen in the group of spectra of a luminous Bunsen flame in plate 2 g. 

Refined measurements of the continuous spectra emitted by flames appear to be 
of little value owing to the variations introduced by temperature and other factors. 
It was found sufficient for the present purpose to take a series of five exposures (by 
racking down the plate holder) for each source and to compare these visually. 
Exposures in the ratio 1:^/3:3:3^3:9 were found suitable. A neon lamp com¬ 
parison spectrum, showing lines of Ne, He and H, w T as used to give an approximate 
wave-length scale. 

Nitric oxide was prepared by dropping 50 % sulphuric acid on to a concentrated 
solution of sodium nitrite and potassium iodide, the gas being stored over water 
and used undried except where otherwise stated (i.e. for CO flames). Other gases 
were commercial products. 

The flames studied were of three types: flames of premixed gases burning at a 
quartz jet, flames of one gas burning in an atmosphere of another (for brief descrip¬ 
tion of apparatus see Gaydon 19426 ), and flames of premixed gases burning at a 
SmithelFs separator. This separator consisted of two concentric quartz or pyrex 
tubes about 1 and 1-5 cm. bore, a cork or similar fitting enabling the smaller tube to 
be slid gently down until its top was about 5 cm. below that of the outer tube. By 
adjustment of gas flow and mixture strength it thus became possible to separate 
the inner cone of hydrocarbon flames on to the top of the smaller tube, leaving the 
outer cone burning at the top of the larger tube. 



115 


Continuous spectra in flames 

The simple discharge tube used for the examination of the air or oxygen afterglow 
is shown (^grammatically in figure 1. It consisted of two pyrex tubes A and B, 
each 30 cm. long and 1 cm. bore, with viewing windows of quartz, W v W 2 and W z , and 
steel electrodes E x and E 2 . Air or oxygen was adniitted through a very fine capillary 
at the tap T x and was evacuated through ta.p T z . Nitric oxide could be admitted at 
T % when desired. The discharge in the tube A was maintained by an induction coil. 
It was found that stray weak discharges to the pump were reduced by connecting 
the electrode E 2 and the pump so that they were at the same potential. The afterglow 
in the tube B was viewed end-on through the window W 2 . 


w 2 



Figure 1 . Discharge tube for afterglow. 


The yellow-green continuum in the afterglow and in flames 

The yellow-green air, or so-called oxygen, afterglow, obtained when a discharge 
is passed through air or through oxygen containing a trace of nitrogen at a pressure 
of the order of a mm. of mercury, has been studied by a number of investigators 
(Rayleigh 1910 ; Stoddart 1934 ; Newman 1935 ), and Speahnan & Rodebush ( 1935 ) 
have convincingly shown that the luminosity is due to a reaction between nitric 
oxide and atomic oxygen, this explanation fitting in well with'the careful observa¬ 
tions made earlier by Rayleigh. The spectrum is mainly continuous, extending 
throughout the visible region with a maximum somewhere in the green. Faint 
banded structure has also been observed, but it is very weak and features are 
difficult to measure. * 

The spectrum of “the afterglow of air and that of oxygen containing a little nitrogen 
have been photographed under similar conditions to the flame spectra. Ithasalso been 
found that, when a very slow stream of nitric oxide is admitted at tap T 2 (figure 1 ) 
and fairly pure oxygen is used in the discharge, a bright glow is obtained at the point 
where the gases mix. A group of spectra of this glow, due to the reaction of atomic 
oxygen formed in the discharge with the nitric oxide, is shown in plate 2 /. The 

8-2 



116 A. G. Gay don 

spectrum is continuous and of maximum intensity in the green; the double maxima 
in the red are due to plate sensitivity being high in these regions. There is some very 
faint banded structure, but even with the high contrast of the process plates this is 
not easy to see. 

It has been remarked that some flames containing oxides of nitrogen show a 
yellow-green continuous emission. Plate 2 e shows a group of spectra of the flame 
of nitrous oxide burning in hydrogen, which is of this type. It is also found that when 
nitric oxide is introduced into some, but not all, flames a strong yellow-green con¬ 
tinuous emission results. This is true for carbon monoxide burning in oxygen 
(plate 2d) and, although rather less strongly, for hydrogen burning in oxygen 
(plate 2 c). 

The colour of the air afterglow and that of these flames containing nitric oxide is 
strikingly similar, and this is borne out by comparison of the spectra. The similarity 
of the colours of the afterglow to that obtained by introducing NO into a Bunsen 
flame was first pointed out by Rayleigh ( 1911 ). There seems no reason to doubt 
that the process responsible for the emission of these yellow-green continua is the 
same, namely, the reaction between nitric oxide and atomic oxygen. The cause 
of the yellow-green colour of flames containing oxides of nitrogen has thus been 
ascertained. 

The absorption spectrum of the molecule N0 2 shows a banded structure in the 
visible region, indicating that absorption in this region results in the formation of an 
electronically excited stable N0 2 molecule. Beyond about 3700 A the band structure 
merges into a continuum, indicating the process 

N0 2 + ^-N0 + 0. 

There is thus the possibility that a collision between nitric oxide and atomic oxygen 
will result in the reverse process, the emission of a light quantum and the formation 
of a normal N0 2 molecule. This process would result, in the absence of appreciable 
thermal energy in the colliding particles, in a continuous emission with short-wave 
limit around 3700 A. Assuming an electron transition probability of the order ! 0 7 
or IQ 8 per sec. and a duration of the order 10~ 12 or 10 -13 sec. for a collision, we obtain 
a probability of about the order 10~ 5 for the reaction 

NO + O = N0 2 + ^ 

to occur as a bimolecular association. This seems to be of the right magnitude. In 
the afterglow each oxygen atom will make about 10 7 collisions per sec., and, as only 
a small proportion of these collisions will be with nitric oxide molecules, this gives 
a reasonable duration for the afterglow, which in practice is usually of the order 
of a second, although much less if nitric oxide is admitted. 

* There is also a possibility of a triple collision between nitric oxide, an oxygen atom 
and a third body. In this case the third body might take up the rather large amount 
of energy set free by the formation of N0 2 , or alternatively it might take up only a 
small part of this energy leaving the excess as electronic energy of excitation of the 



117 


Continuous spectra in flames 

newly formed N0 2 molecule. In this latter case the N0 2 molecule might then emit 
a banded spectrum. This would account for the faint banded emission superposed 
on the yellow-green continuum of the afterglow spectrum. 

The majority of ordinary flames in air do not contain appreciable quantities of 
nitric oxide so that even if atomic oxygen is present the yellow-green continuum 
will not be emitted. For the very hot oxy-hydrogen flame, however, small quantities 
of nitric oxide may be formed if air is allowed to mix in with the flame, and so the flame 
shows the faintly luminous tip which has been commented on. Similarly, nitrous 
oxide usually breaks up into oxygen and molecular nitrogen rather than to oxides 
of nitrogen, and so the yellow-green continuum is not necessarily emitted even if 
atomic oxygen is present; under certain flame conditions, such as nitrous oxide 
burning in hydrogen, however, it seems that both nitric oxide and atomic oxygen 
are present, and so the flame shows the continuous emission; all flames of this type 
smell strongly of oxides of nitrogen. 

Nitric oxide in elames 

The assignment of the yellow-green continuum to a reaction between nitric oxide 
and atomic oxygen gives us a simple qualitative test for the presence of atomic 
oxygen in a flame by introducing a little nitric oxide and observing whether or not 
the yellow-green continuous emission is then obtained. In this section the experi¬ 
mental results obtained by subjecting a number of flames to this test are described; 
application of the results to a discussion of the combustion mechanism is deferred 
to later sections. 

Carbon monoxide . Flames, of all types, of carbon monoxide burning with air or 
oxygen are bright blue in colour. In all cases the addition of a little nitric oxide 
produces a marked change to a whitish yellow-green. The effect is particularly strong 
when the NO is added to a flame of CO burning in 0 2 . Groups of spectra illustrating 
this are shown in plate 2 a and d. It may be noted that the addition of a little 
nitrous oxide instead of nitriooxide does not produce the same effect. 

The original experiments on CO flames were carried out with undried gases. In 
view of the importance of moisture in the combustion of CO, a flame of CO 4- NO in 
0 2 in which the gases were carefully dried was also examined. The CO and 0 2 were 
dried by slow passage through a liquid oxygen trap, and the NO was passed first 
through sulphuric acid and then through three U tubes packed with phosphorus pent- 
oxide. The yellow-green colour was again produced with about the same intensity. 
These experiments cannot be compared with those on the combustion of intensively 
dried carbon monoxide, but they serve to show that moderate drying does not 
materially affect the concentration of atomic oxygen in the flame, which remains high. 

Flames of carbon monoxide burning with nitrous oxide are bluish in colour and 
show a rather similar spectrum to that of flames with oxygen, but the blue shade is 
somewhat less intense, and this has already been commented on (Gaydon 19426). 
This departure from the bright blue colour is particularly noticeable for the pre- 



118 A, G. Gaydon 

mixed C0-N 2 0 flame, and indicates some slight nitric oxide formation, the reaction 
of which with atomic oxygen leads to yellow-green emission which dulls the bright¬ 
ness of the blue colour of the flame. The addition of a very little nitric oxide to these 
flames supported by nitrous oxide produces a very marked intensification of the 
yellow-green colour. 

Hydrogen . When a little nitric oxide is added to a flame of hydrogen burning in 
oxygen a definite, although not strong, coloration results. The colour is rather 
greyish green, the departure from the yellow-green shade being apparently the 
result of admixture with the natural blue shade of the untreated flame. With the 
flame of oxygen burning in hydrogen, which is quite a bright blue, the effect of 
adding nitric oxide is less marked. When the NO is admitted in the H a stream there 
does not appear to be any appreciable effect on the colour of the flame, which, 
however, enlarges slightly. When the NO is admitted in the 0 2 stream there does 
appear to be a slight greenish coloration, but not very marked. In either case, how¬ 
ever, there is a definite effect on the outer flame of the excess hydrogen burning in 
air, which becomes grey-green and shows the continuous emission spectrum. Groups 
of spectra of hydrogen burning in oxygen and the same with a little nitric oxide 
added are shown in plate 2 b and c. 

Flames of hydrogen burning with nitrous oxide are of rather varied type (Dixon 
& Higgins 1926 ). In some cases the spectrum shows NO, NH and the ammonia a 
bands (Fowler & Badami 1931 ) and in other cases, such as N 2 0 burning in H 3 or 
the premixed flame containing excess of N 2 0, the yellow-green continuum is strong 
and dominates the spectrum. 

Methane. Flames of methane in oxygen or air or vice versa are luminous (carbon 
deposition) and are not suitable for the nitric oxide test. The premixed OH 4 ~air 
flame, however, gives striking results. With a simple premixed'flame of the Bunsen 
type it was observed that fairly strong yellowish green coloration was produced by 
the addition of nitric oxide, and that this colour was apparently restricted to the 
outer zone of the flame. This was confirmed by study of the separated cones in a 
SmithelTs separator. 

The methane-air flame in the SmithelTs separator shows a bright blue-green inner 
cone and a very pale blue-violet outer cone. On admitting nitric oxide the inner cone 
turns bright violet and the outer cone shows the yellow-green colour strongly. This 
is a very simple and effective experiment. 

The spectrum of the inner cone, before the admission of nitric oxide, shows the 
usual C 2 Swan and CH bands. When nitric oxide is present the GN Violet system 
comes up strongly, this accounting for the violet colour, and the C 2 Swan bands are 
partly suppressed; the CH bands do not appear to be affected (see plate 2h). With 
a steadily burning flame there is not the slightest trace of the yellow-green con¬ 
tinuous emission in the inner cone. Only with an unsteady flame, or with excess 
air, does a slight evanescent yellow-green colour appear above and around the inner 
of the separated cones, and this is clearly due to incomplete separation of the tw r o 
types of combustion occurring in the two cones. 



119 


Continuous spectra in flames 

Ethylene . The results with ethylene are similar to those with methane, but the 
yellow-green coloration of the outer cone is rather less strong unless* a fairly large 
amount of nitric oxide is present. This may be due to the partial decomposition of 
the nitric oxide in this very hot flame. 

Butane. The results for butane are also similar to those for methane. In the 
SmithelTs separator the inner cone changed from blue-green to violet when nitric 
oxide was added, while the outer cone changed from pale blue-violet to bright 
yeHow-green. 

The combustion of carbon monoxide; the role of atomic oxyoen 

The experimental results detailed above serve as a qualitative test for the presence 
of atomic oxygen in the various flames studied. For the carbon monoxide flame it is 
clear that a considerable concentration of atomic oxygen exists, both for the dry 
and moist gases. Previously the author (Gaydon 1942 a) has been sceptical of com¬ 
bustion mechanisms involving atomic oxygen, but it now seems that this attitude 
must be modified in discussing the carbon monoxide flame as there can be little 
doubt that the high concentration of atomic oxygen is the result of the combustion 
processes and is not merely thermal in origin. It should be remembered that the 
concentration of atomic oxygen will depend on the processes consuming the atoms 
as well as on those responsible for their liberation. 

The several proposed combustion mechanisms for moist and dry CO have been 
dealt with in books by Hinslielwood ( 1933 ), Semenoff ( 1935 ) and Lewis & von Elbe 
( 1938 ). For the dry reaction it is very difficult to devise a mechanism involving only 
CO, 0 2 , 0 and C0 2 which will account satisfactorily for the chemical kinetics. Lewis 
& von Elbe have therefore supposed that ozone and a hypothetical carbon peroxide 
C0 3 may take part in the reactions. Of the reactions included as likely by them, the 
following lead to the formation of atomic oxygen: 

O3+CO = C0 a + 0 + 0-15kcaL, (1) 

0 3 + C0 = C0 3 + 0, ( 2 ) 

and the following to its removal: 

0 + 0 2 + Jf ==0 3 + Jf, (3) 

O + CO + Jf « C0 2 + Jf. (4) 

They rightly stress that at room temperature reaction (4) is at least 500 times slower 
than (3) and is therefore probably unimportant as a chain-breaking process. 

The flame spectrum of CO sho^s a banded spectrum on a continuous background. 
Tins banded spectrum is believed* (Gaydon 1940 ) to be emitted by C0 2 . Kondratjewa 
& Kondratjew ( 1937 ) have shown that under favourable conditions" at least one 
quantum of light is emitted by the flame for every 125 normal C0 2 molecules formed. 
This is vastly too much radiation to result from thermal causes, and in any case 
there is no evidence that this spectrum can be obtained from C0 2 molecules by either 



120 


A. G. Gaydon 

thermal or electrical excitation. Of the processes retained by Lewis & von Elbe, 
( 4 ) is the only one liberating sufficient energy for the formation of electronically 
activated C0 2 molecules 

CO + O + lf =*001 Hr (4a) 

This process is probably slightly endothermic. It might be followed by 

CO J = C0 2 + hv {banded), (46) 

by C0 2 * + 0 2 = C0 2 + 0 + 0, (4c) 

which is probably slightly exothermic, or by reactions involving the dissociation 
of the COf or another C0 2 molecule. 

It has already been stressed by the author that there is no evidence that the 
electronically excited C0 2 molecules are metastable. Their life is probably between 
10~ 8 and 10~ 5 sec. This would, however, allow for a fair number of collisions with 
oxygen molecules and, although sufficient radiation by process (46) must take place 
to give the observed band spectrum, the majority of the excited molecules will 
probably end in reaction (4c). Thus reaction (4) should be regarded as chain-branching 
and not chain-breaking. 

It is usual, in discussing combustion mechanisms, to disregard direct bimolecular 

associations because of the difficulty of removing the excess energy from the system. 

For the special case of the carbon monoxide flame, however, it is clear that the 

bimolecular association ‘ ~ , , 

O + NO = N0 2 + /w, 

resulting in the yellow-green continuous emission, is able to compete for atomic 
oxygen with the combustion processes, and therefore there seems no reason why 
the similar association 

O + CO = C0 2 + hv {continuous) (5) 

may not also have some probability. The reaction (5) probably requires some 
activation energy for a favourable collision accompanied by radiation, and the 
probability will in any ease be less than 10~ 5 at each collision, but nevertheless the 
process may account for some of the continuous background to the flame spectrum. 
The existence of an activation energy for this reaction fits in with the observation 
that the continuous background is weaker, relative to the banded spectrum, for 
flames at low pressure and temperature and especially for the cool flame (Gaydon 
I 943)- 

It has been stressed (Gaydon 1942 a) that normal carbon dioxide, C0 2 (-XL?), 
cannot be formed from normal CO {X 1 E) and oxygen, 0 ( 3 P), without electronic 
rearrangement. Thus reaction ( 1 ) probably requires a very high activation energy in 
addition to the 15 kcal. absorbed in the reaction, and may therefore be of little 
importance. These considerations also apply to reaction (4) but not to (4a). Little 
can be said about the hypothetical reaction ( 2 ). Prom a spectroscopic viewpoint it 
seems that reactions (4a), (46), (4c) and (5) are required to account for the spectrum 
and the presence of atomic oxygen in the dry flame. 



121 


Continuous spectra in flames 

It should, however, be stressed that perfectly dry CO will not readily burn at all 
(Ubbelohde 1933 ), and that although the reactions listed above may serve to main¬ 
tain combustion they do not account for its initiation. It is likely that for reaction 
(4:a) to take place the third body M may have to fulfil special conditions and that 
water is particularly suitable. 

For the moist reaction the concentration of atomic oxygen in the flame is also 
high. It seems likely, from the present study, that there is no essential difference in 
the combustion mechanism in the presence of a trace of water, but that its thermal 
decomposition may serve to initiate the reaction chains, and that by collisions the 
water molecules may quickly set free the vibrational energy of the newly formed 
C0 2 molecules so that this energy can thermally assist the further reaction instead 
of merely producing abnormal dissociation of the C0 2 in the flame products. How¬ 
ever, the reaction chain 

OH 4-CO = CO a + H + 27kcaL, H + 0 2 = OH + O-18kcal., 

which could be initiated by thermal decomposition of H 2 0 to OH and H, would 
also lead to accumulation of atomic oxygen and may be important. 

The hydrogen flame 

For flames in which oxygen is in excess, i.e. for hydrogen burning in oxygen or 
air, there is a moderate concentration of atomic oxygen, although not as high as 
that in the CO flame. When hydrogen is in excess, i.e. oxygen burning in hydrogen, 
the concentration of atomic oxygen appears to be small. 

The results do not appear to be sufficiently definite to draw any certain con¬ 
clusions. It seems that some atomic oxygen may be formed by the reaction 

H + 0 2 = OH + 0—18 kcal., ( 6 ) 

and that the removal of this atomic oxygen in the presence of an atmosphere of 
hydrogen may be more rapid than in an atmosphere of oxygen, but in the absence 
of quantitative data it cannot be said whether this reaction is of major importance 
in the combustion mechanism for hydrogen. 


The mechanism of hydrocarbon combustion 

The experiments on methane, ethylene and butane in the SmithelTs separator 
show clearly that in the outer cone, corresponding to the later stages of the com¬ 
bustion when hydrogen and carbon monoxide are the principal combustibles, there 
is a high concentration of atomic oxygen, but that in the inner cone, corresponding 
to the early stages of the combustion of the hydrocarbons, atomic oxygen is not 
present in sufficient quantity to give the slightest coloration with the nitric oxide 
test. It is' known that for methane at room temperature there is practically no 
reaction with atomic oxygen and that methane does not show an 'atomic flame’ 



122 A. G. Gaydon 

with, oxygen. It therefore seems fairly certain that the negligible concentration of 
atomic oxygen in the early stages of the combustion must be due to its not being 
formed rather than to its very rapid removal. This receives experimental support 
from the observation that a flame of mixed carbon monoxide and butane (with a 
fairly high proportion of CO and air) burning in a Smithell’s separator does show the 
yellow-green coloration in the inner cone, indicating presumably that the butane 
is not particularly effective in removing the atomic oxygen. 

Two main types of mechanism, and a number of variants on these, have been 
suggested to explain the initial stages of hydrocarbon oxidation. These are usually 
referred to as the hydroxylation and peroxidation theories. The hydroxylation 
theory, put forward by the late Professor W. A. Bone, assumes the successive 
introduction of hydroxyl groups into the hydrocarbon molecule, and its most pro¬ 
mising modification, that proposed by Norrish, makes use, for methane, of the 
chain process 

O 4 - CH 4 = CH 4 0 - 0H 2 + H 2 0, CH 2 + 0 2 - H 2 00 + 0. 

According to the peroxidation theory as suggested by Engler and by Bach and 
applied to combustion processes by Callendar and by Sir Alfred Egerton and others, 
reaction occurs when a sufficiently energetic oxygen molecule combines momen¬ 
tarily with an energetic hydrocarbon molecule to form a temporary peroxide which 
subsequently breaks down to aldehyde and water, setting free energy to maintain 
the combustion. Variants on this theory proposed by ITbbelohde and by von Elbe & 
Lewis make use of reaction chains involving, for methane, the radical CH S and 
either another peroxidic radical or OH. These combustion mechanisms have been 
discussed elsewhere (Gaydon 1942 a), and it has been shown that certain spectro¬ 
scopic observations give some support for a peroxidation rather than a hydroxyl¬ 
ation theory. 

These new experiments, showing a negligible amount of atomic oxygen in the 

inner cone, appear to provide evidence 1 against Norrish’s modification of the 

. hydroxylation theory which requires atomic oxygen for the chain reactions. 

* 

It is a pleasure to express my thanks to Sir Alfred Egerton for his close interest 
in the work and helpful discussions. I am also indebted to the Council of the Royal 
Society for generous financial assistance. 


References 

Dixon, H. B. & Higgins, W. F. 1926 Mem. Manchr Lit. Phil . Soc . 71, 15. 

Finkelnburg, W. 1930 Phys. Z. 31, 1. . 

Finkelnburg, W. 1934 Phys. Rev. 45, 341; 46, 330. 

Finkelnburg, W. 1935 Phys. 93, 201. 

Fowler, A. & Badami, J. S. 1931 Proc. Roy . Soc. A, 133, 325. 

Gaydon, A. G. 1940 Proc. Roy. Soo. A, 176, 505. 

Gaydon, A. G. 1942a Spectroscopy and combustion theory . London: Chapman and Hall. 
Gaydon, A. G. 19426 Proc. Roy. Soc. A, 181, 197. 

Gaydon, A. G. 1943 Proc. Roy. Soc. A, 182, 199. 



123 


Continuous spectra in flames 

Hartley, W. N. 1894 Phil. Tram. A, 185, 161. 

Hinshelwood, C. N. 1933 Kinetics of chemical change in gaseous systems . Oxford. 
Kondratjewa, E. & Kondratjew, V. 1937 J. Phys. Chem. Russ, 9, 736. 

Lewis, B. & von Elbe, G. 1938 Combustion flames and explosions of gases, Cambridge. 
Liveing, G. D. & Dewar, J. 1891 Proc . Roy, Soc, A, 49, 217. 

Newman, F. H. 1935 Phil. Mag, 20, 777. 

Rayleigh, Lord (R. J. Strutt) 1910 Proc, Phys. Soc. 23, 66 . 

Rayleigh, Lord (R. J. Strutt) 1911 Proc Roy. Soc. A, 86 , 56. 

Semenoff, N. 1935 Chemical kinetics and-ehain reactions. Oxford, 

Spealman, M. L. & Rodebush, W. H. 1935 J. Amer. Chem. Soc. 57, 1474. 

Stoddart, E. M. 1934 Proc. Roy. Soc. A, 147, 454. 

Ubbelohde, A. R. 1933 J* Ohem. Soc • P- 972 . 

Urey, H. C. & Bates, J. R.- 1939 Phys. Rev. 34, 1541. 

Weston, F. R. 1935 Proc. Roy. Soc. A, 109, 177, 523. 


Fqr description of plate see page 124. 



124 


A. G. Gaydon 


Description oe Plate 

Groups (a)-(gr) each show at the top the spectrum of a neon glow lamp for approximate wave¬ 
length scale, and below five spectra of the various sources studied with exposures increasing 
in the ratio 1: ^/3:3:3 :9. Ilford Process Panchromatic plates were used, and conditions of 
development, etc., were maintained as constant as possible throughout. 

(а) Flame of CO burning in 0 2 . Exposures 10-90 sec. 

(б) Flame of 0 2 burning in H 2 . Exposures 2-18 min. 

(c) Flame of H 2 mixed with NO binning in 0 2 . Exposures £-4J min, 

(d) Flame of CO mixed with NO burning in O a . Exposures 10-90 sec. 

(e) Flame of N 2 0 burning in H 2 . Exposures 1-9 min. 

(/) Glow produced by mixing nitric oxide with afterglowing oxygen from electric dis¬ 
charge. Exposures 6§-60 min. 

(g) Luminous Bunsen flame. Exposures 5-45 sec. This serves to illustrate photographic 
plate sensitivity. 

(h) Upper spectrum; inner cone of flame of premixed methane and air in SmithelPs 
separator. Exposure 10 min. Lower spectrum; the same with nitric oxide added. 




(g) luminous Bunsen flame 











Relaxation methods applied to engineering problems 
X. The graphical representation of stress 

By D. N. de G. Allen, B.A. and R. V. Southwell, F.R.S. 

(.Received 27 January 1944) 


A convenient representation of a vector quantity (e.g, the shear stress on a specified plane) 
is by contours which exhibit its resultant magnitude and by ‘trajectories* which have its 
direction at every point. But it sometimes happens that only its components in two per¬ 
pendicular directions are known, and then, while the contours are easy to construct, deriva¬ 
tion of the ‘trajectories’ presents a special problem. If this can be solved, tensor quantities 
also will be representable, e.g. a state of plane stress. 

By orthodox methods the problem would appear to be difficult, but in this paper it is 
shown to yield to ‘relaxations!’ attack. 


Introduction 

1. This paper deals with a class of problem having importance in various branches 
of mathematical physics, and special importance in the theory of elasticity. Its 
worked examples all relate to the representation of stress, and its title has been 
chosen accordingly; but it has much wider application. 


Graphical representation of a vector quantity 

2. A vector quantity has magnitude and direction, therefore requires two quan¬ 
tities for its specification. For example, the shear stress on a plane ( x , y) can be speci¬ 
fied by its components 


X z = S cos 6, Y z = Ssmd, 


(i) 


8 denoting the resultant shear intensity, 6 the inclination of its direction to the 
axis Ox, If now it is desired to present this specification graphically, an obvious 
means of doing so will be (a) a diagram showing by contours the variation of S 9 
(b) a family of "stress trajectories’ which, being everywhere inclined at 6 to Ox, 
have the direction of the resultant shear. 

From (1) we have 


S* = Xl+Yl, 


(*) 


whence S can be calculated when X z , Y z are known (e.g. from experimental measure¬ 
ments); so construction of the contours (a) presents no difficulty. To construct the 
stress trajectories (6), the simplest procedure is to regard them as contours of some 
function J3 not yet determined. Then 

cos#!~H-sin#^ = 0, (3) 


since /? has zero gradient in the direction 6; and combining (3) with (1) we see that 
(provided 8 is nowhere infinite) 


z -s +r '|"°- 


(4) 


Vol. 183. A. (30 November 1944) 


[ 125 ] 


9 



126 


D. N. de G. Allen and R. V. Southwell 


Our problem is to deduce fi from this equation, which implies that 




V V 


(5) 


a being some other function of x and y, not yet determined. 


3. Usually, in the theory of elasticity, though X z and Y z are not known initially, 
expressions of the type of (5) can be assigned to them.* Our problem is the con¬ 
verse of this: X z , Y z being given, the corresponding distributions of oc and /? must 
be found in order that ^-contours (the ‘stress trajectories 5 ) may be constructed. 
The function a will be easy to deduce when J3 has been found, since 


by (5) and (2). 


HD’ 


+ (|)} = x 5 + ^- S! ' 


( 6 ) 


Graphical representation op a tensor quantity 

4. In general stress is a tensor, not a vector quantity, having three components 
even in the two-dimensional case. In a state of ‘plane stress’, for example, 


Z x = %v ~ %z — 0 


(7) 


and the other stress components have the expressions 


X _ Y _&X x d2 X 

x ~ dy 2 ’ v ~dx z ’ y ~ dxdy 


(3) 


in terms of a stress function % which satisfies the ‘biharmonic equation 5 

V 4 x = 0. ' (9) 

More generally, i.e. when (7) are not satisfied, three further quantities are entailed. 
But Z z can be exhibited by contours, and the shear stress of which Z x , Z y are com¬ 
ponents can be represented by contours and by ‘stress trajectories 5 in the manner 
of §§ 2-3; so the only problem remaining is to represent, for all points in a specified 
plane ( x , y), a state of stress specified by its components X x , Y y , X y when these are 
not given by functional expressions, of the type of (8). 


* Thus the stresses induced by torsion in a rod of circular section but non-uniform diameter 
have the expressions 

1 3(j> 


Qz^-z. 

r 2 Sr 5 


r 2 8z 


(i) 


in terms of a function^ which has a constant value on the generator of the stress-free boundary, 
and which satisfies the equation 



at every point in the material. These expressions become identical with (5) if, in the latter, 
z, r, 6, r~ 2 , <j> are substituted for x, y, z, a, ft. 



Relaxation methods applied to engineering problems 127 

k 

Love ( 1927 , §57) cites a mode of presentation suggested by Michell ( 1901 ), but 
implies that a complete representation of stress is not practicable. It would seem, 
however, to present no special difficulty once the problem of vector representation 
(§§ 2 - 3 ) has been solved; for a stress defined by X x , Y y) X y will be completely repre¬ 
sented by (a) a set of orthogonally intersecting trajectories directed along the planes 
of principal shear stress, together with ( b) contours exhibiting the local intensity of 
the principal shear stress and (c) contours exhibiting the distribution of (X x ~j-Y y ), 
a quantity which is invariant for change of axes. The intensity ( 6 ) of the principal 
shear stress is given by 

* \8\=^{(X x -r y r+4Xi}, ■ (io) 

and (a) the planes of principal shear stress are inclined to Ox at angles 0 l9 d % which 
are roots of the equation 

cot 2d = -2XJ(X x -Y y ). m (11) 

Thus all of | S |, # 2 , (X x +Y y ) are calculable, and ( 6 ) and (c) present no difficulty. 

To plot the trajectories (a) we require to know the distribution of some function 
/? such that 

cos#^4-sin0^ = 0, (3)bis 

dx oy 

when 64s identified either with 6 1 or # 2 . This problem has been considered already . 


Outline of the relaxatipnal attack 

5. Graphical representation of stress, whether this be a vector or a tensor quantity , 
thus entails the finding of a function /? which satisfies an equation of the type of 

in which X z , Y z are known. This is a purely mathematical problem, not yet attacked 
by Relaxation Methods. For orthodox (analytical) methods it would seem to be in 
general difficult, even when X 2 , Y z are specified functions of x and y\ and insoluble 
when, as will often happen, their functional forms aire not known.* 

It is clear that solutions will be in some degree arbitrary, since (4) is still satisfied 
when /? is replaced by any function of /?. For the same reason, any values may be 
given to /? along contours which are known initially, either as being parts of a stress- 
free boundary, or from considerations of symmetry. But there is nothing arbitrary 
in the shapes of the resulting contours, which alone have practical importance; and 
the problem yields without difficulty to a relaxational treatment, dft fix and dfifiy 

* In hooks, for example, the shear stress on a cross-section can be related, (along with the 
other four stress-components) with two stress functions <j> and tfr (Southwell 1942 , § 26); but 
the stress functions have not so far been determined analytically, and although they can be 
found by Relaxation Methods, these give values of the stress-components only at nodal 
points of a chosen net. 


9-2 



128 


D. N. de G. Allen and R. V. Southwell 


in (4) being replaced, as usual, by finite-difference approximations. Details will be 
best explained in relation to particular examples. 

Example I 

6 . It can be shown (Love 1927 , § 231 (/)) that, when Poisson’s ratio cr = J, 

= Y z = \xy ( 12 ) 

represent (with neglect of a constant factor) the shear stress called into play when 
a uniform shearing action is applied to a beam of which the axis is Oz and cross- 
sections are bounded by the curve 

" x 2 ja % -f 16y 4 /a 4 = 1. (13) 

This curve is symmetrical with respect both to Ox and to Oy, and clearly Ox is one 
of the wanted contours, since at every point on it Y z = 0 . Accordingly /? may be 
given the values zero along Ox, 100 along the curved boundary (13). Since /? must 
also have symmetry about Oy, computation may be confined to a single quadrant. 

This problem has been treated with results which are shown in figure 1 : fine lines 
give the correct (theoretical) contours, bold lines the contours derived by Relaxation 
Methods on the fairly coarse net shown. The computed values (shown to the left of 
nodal points) are not sufficiently numerous to determine the forms of the stress 
trajectories near A; but here their exact shapes do not matter, since the shear stress 
tends to zero at A.* 



Figure 1 


7 . The finite-difference approximation to (4) is 

F 0 = (Z 2 ) 0 (/? 1 -y? 3 ) + (Z) 0 (/? 2 -A) = 0, (14) 

i.e. in this instance, 

F 0 = (^-16)(A-A) +to(A-A) = o ; (15) 

* Because ‘shear stress cannot cross an unloaded boundary’. Cf (e.g.) Southwell 1941 , § 129. 



Relaxation methods applied to engineering problems 129 

the suffixes 0 , 1 , 2 , 3, 4 relating to points so denoted in figure 2 a. When 0 lies on Ox, 
(15) is satisfied identically, and when 0 lies on Oy it is satisfied in virtue of symmetry 
(in both instances because = /? 3 ). From (15) we deduce the "relaxation pattern’ 
shown in figure 26, which represents the effect on the F’s of a unit increment Afi 0 
made to tlie value of at an isolated point 0 . The use of such "patterns’ has been 
explained in previous papers of this series: it is in no way altered (although the 
labour of "liquidation’ is increased) by the fact that here a different pattern 
holds for every point. " Irregular stars 5 can be treated in the manner of Part III 
(Christopherson & Southwell 1938 , §§ 23-24). 

© 

0—Q—0 

© 

(t) 

Figure 2 



i 



In figure 3, where the semi-axes are 4 and 2 , so that a = 4 in ( 12 ) and (13), values 
of (« 2 —16) and of \xy are recorded at nodal points, and the lengths of all ‘irregular’ 
strings are shown (in square brackets). Trial values for /? gave calculable ‘residuals ’ 
at the internal nodes: these were liquidated by driving them towards the axis Oy, 
where /?, being unrestricted, could be altered to liquidate residuals at neighbouring 
(internal) nodes. 



130 


D. N. de G. Allen and R. V. Southwell 


Example 2 

8 . Our second example is of the kind discussed in § 4. We seek a graphical repre¬ 
sentation of the solution found in an earlier paper of this series to a problem in 
‘plane strain 5 ,—namely, the effect of specified boundary displacements* imposed 
on a rectangle having sides in the proportion 3:1. 

! 


o 

O 

O 

O 

o 

o 

O 

O 

O 

O 

O 

O 

O 

“-aoooo 

-69373 

-37500 

-84375 

-80000 

-7+375 

-47500 

—S9375 

-50000 

-3537 3 

-27500 

-1+375 

o 

o 

656 

<493 

2+80 

3)51 

3758 

+244 

4553 

+408 

4340 

3497 

2+03 

o 

-7634-7 

-77990 

-74320 

-73534 

-45435 

-4+41+ 

-58+71 

-3)203 

—♦281+ 

-333)9 

-22777 

-1)393 

o 

O 

>438 

2*3+ 

+ 1+2 

5315 

4258 

7027 

7+25 

7337 

4820 

553+ 

3325 

o 

-44898 

-664)6 

-4+974 

-42573 

-55205 

-5+848 

-45542 

-43284 

-34052 

-27895 

-189)9 

-9+03 

o 

O 

1772 

3448 

5091 

4515 

7700 

8550 

8971 

58+2 

8022 

4353 

3490 

o 

-SS157 

-54799 

—33604 

-51412 

-46620 

-+5228 

—+0837 

-35452 

-29488 

-22584 

-154+0 

-7855 

o 

o 

1679 

3*97 

5393 

4859 

8134 

50)3 

5+2+ 

524+ 

-8329 

4533 

37SO 

o 

* -4336* 

-40253 

—423IS 

—+0750 

-38555' 

*'35745 

-32311 

-28245 

-23423 

-18+12 

-12480 

-4506 

o 

© 

>767 

3+78 

5075 

4493 

7455 

8488 

8880 

8718 

7843 

4190 

3548 

O 

“3 ZliS 

-31666 

-31204 

-30041 

-2848C 

-244+5 

-23543 

-210*0 

-17732 

-/393+ 

-98+2 

-5223 

o 

O 

M-32 

2820 

+115 

5277 

4234 

4931 

7283 

7199 

4548 

5255 

310+ 

_ O 

-2 OSS 4- 

-20*09 

-2037+ 

-19450 

-184+) 

-17353 

-15792 

-13948 

-1)890 

-9558 

-4938 

-3885 

’ o 

O 

as4 

1483 

2+42 

3)41 

37+8 

+187 

4+3+ 

+437 

♦130 

3+23 

2)58 

o 

-10202 

-10133 

-9928 

...-5587 

-5113 

-63)1 

-7784 

-45+5 

-S 993 

-+93+ 

-37♦< 

-2291 

o 

4_ 

o 

O 

O 

O 

O 

O 

O 

O 

O 

O 

_O 

_ O 

o 

O 

O 

O 

O 

0 

"6 

O 

O 

O 

O 

O 

O 

o 


4 

Figure 4 


A diagr am in the earlier paper presented its solution in the form of displacement 
values computed for nodal points of a square-mesh net: for this investigation those 
values were recalculated to one more significant figure (figure 4), and from them 
distributions of (X x -Y y ) and of X y were derived and recorded in figure 5. (Eor 
present purposes absolute magnitudes are not material.) It would of course be 
possible to deduce*the corresponding stress function x : the problem here, how¬ 
ever, is to replace figure 5 by diagrams of the type of (a), (b ) and (c), §4, without 
having recourse to explicit functional expression. 

* As figure 4 shows, u— 0 on all four edges, and v — 0 on all except the top edge, where it 
has a parabolic distribution tending to induce compression. 




131 


Relaxation methods applied to engineering problems 



Figube 6 















132 


D. 1ST. de G. Allen and R. V. Southwell 


I 





Figure 7 . Example 2: trajectories and contours of principal shear stress. 




133 


, Relaxation methods applied to engineering problems 

9. Figure 6 records the nodal values of 0 l9 one of the two roots of equation (11): 
the second root 0 2 = ^ + 90°. There is no question of the value appropriate to each 
family of ‘trajectories’, and for each family computation proceeds on the lines of 
§7, but from (3) as governing equation instead of (4). No feature calls for special 
notice. 



Figure 9. Example 3: trajectories and contours of principal 
stress difference (= 2 x principal shear stress). 



The two orthogonally intersecting families are exhibited in figure 7, together with 
bold-line contours, deduced from (10), which give the intensity of the principal 
shear. Figure 8 shows the variation of the stress sum (X x +T y ) as deduced from 
figure 4. Here too, no novel problem is confronted. 





134 


D. N. de G. Allen and R. V. Southwell 


Example 3 

10. Figures 9 and 10 (corresponding with figures 7 and 8) show the same problem 
solved for a stress system which is partly plastic, partly elastic, so that the contours 
exhibit e refraction 9 at a plastic-elastic interface . The stress system is that induced 
by tension applied along the horizontal centre-line of a symmetrical specimen 
having two semicircular notches which form a ‘waist’. The computed stress- 
components, being based on a particular hypothesis regarding plastic strain, 
are open to question; but this is of no importance to the present paper, which is 
concerned solely with the graphical representation of a specified stress. 


References 

Christopherson, D. G. & Southwell, R. V. 1938 (‘Part III’) Proc. Roy. Soc. A, 168, 317-350. 
Love, A. E. H. 1937 The mathematical theory of elasticity , 4th ed. Camb. Univ. Press. 
Michell, J. H. 1901 Proc. Lond. Math. Soc. 32, 35-61. 

Southwell, R. Y. 1941 Introduction to the theory of elasticity , 2nd ed. Oxford Univ. Press. 
Southwell, R. V. 1942 Proc. Roy. Soc. A, 180, 367-96. 


On the theory of point-particles 

By H. J. Bhabha, F.R.S. and Harish-Chandra 
Cosmic Ray Research Unit , Indian Institute of Science , Bangalore 

(Received 20 March 1944) 


It is deduced from the conservation of the energy-momentum tensor that if the flow of energy 
and momentum into a tube surrounding a time-like world-line, on which the field is singular, 
become singular as the size of the tube is contracted to zero, then the singular terms are 
necessarily perfect differentials of quantities on the world-line with respect to the proper time 
along the world-line. The same can be proved of any other tensor, as, for example, the 
angular-momentum tensor, which is conserved. It is proved from this that for any point- par¬ 
ticle whatever having charge, spin or other properties, which need not be specified, it is 
always possible to deduce exact equations of motion which are finite. 

It is proved further that if the energy-momentum tensor is altered by the addition of 
3 K^/dx*, where K^ vcr is any tensor antisymmetric in v and 0 % then the equations of motion 
are unaltered, but it is possible to choose in such a way as to make the flow of energy 
and momentum into a given tube non-singular. 


By a point-particle is understood a particle whose field-producing and inertial 
properties are all located at a point. The particle may have a finite charge and a 
finite mass, but the charge density and mass density are exactly zero at every point 
of space other than the point at which the particle is located at that instant of time. 
The motion of the particle through space-time is therefore described by a time-like 
world-line. If the particle possesses a dipole or a higher multipole moment, then this 
is described by a suitable co-ordinate having a given value at each point of the 



135 


On the theory of point-particles 

world-line, but it is again assumed that the dipole density is exactly zero at all 
points of space-time not lying on the world-line. Thus the motion of the particle 
through space-time is completely described by a time-like world-line with the values 
of the co-ordinates describing the spin and other properties of the particle given 
at each point of it. 

It has been shown by Dirac (1938) that an exact theory of a point electron moving 
in an electromagnetic field can be set up free from singularities, and Bhabha (1940) 
and Bhabha & Corben (1941) have shown that a similar theory free from singularities 
can be set up for a point dipole. It has been shown further by one of us (Bhabha 
1939,1941) that the theory can be extended to point-particles interacting with meson 
fields. These cover all the cases of practical'interest, but a general demonstration that 
it is always possible to set up an exact classical theory for point-particles interacting 
with any field has not so far been given. This will be done in the present paper. 

In all the work referred to above, the method used for finding the equations of 
motion of the point-particle is the same. From the field equations we calculate the 
field produced by the point-particle, or more exactly, we take that solution of the 
homogeneous field equations which has a singularity of the required type on the 
world-line. We now surround a finite length of the world-line by a world tube whose 
radius is ultimately made to tend to zero, and calculate the flow of energy, momen¬ 
tum and angular momentum into the world tube from outside by using the usual 
energy-momentum and angular-momentum tensors of the field. For brevity we 
shall refer to the quantities so calculated as the inflow. The equations of motion are 
now found from the condition that conservation of energy, momentum and angular 
momentum require that the flow of all these quantities into the tube must only 
depend on conditions at the two ends of the tube, that is, it must only be a function 
of the co-ordinates of the particle and their higher derivatives and also possibly of 
the field quantities at the two ends of the tube. Since the field is singular on the 
world-line, the usual energy-momentum tensor is also singular, and in consequence 
the flow of energy and momentum into the tube is likewise singular. That in spite of 
this it has always been possible to derive finite equations for the motion of the point- 
particles has always depended on the circumstance that the singular parts of the 
inflow over an infinitesimal length of the tube are always perfect differentials. We 
shall prove that this is a general property which is a consequence only of the con¬ 
servation of the energy-momentum tensor and therefore that finite equations for 
the motion of a point-particle of any type whatsoever can always be derived. 

We shall also show that it is always possible to modify the energy-momentum 
tensor by the addition of the divergence of a tensor of higher rank in the manner 
suggested by Pryce (1938) so that the inflow over a tube of given shape becomes 
finite. But the inflow would not necessarily be finite over a tube of any other shape. 
That it is always possible to modify the tensor in the same way as to make finite the 
inflow over tubes of any arbitrary shape, as also the energy and momentum integrals 
over arbitrary space-like surfaces is shown by one of us (H.-C.) in the paper which 
immediately follows this. 



136 


H. J. Bhabha and Harish-Chandra 


1 . We use the same notation as in the previous papers by one of us (H. J.B.). 
We take the metric tensor to have the form g Q0 = — g n = — gr 22 = — 9s3 = 1 "with 
= 0 for n^v. The world-line of the particle is described by four co-ordinates 
z-“(r) which are functions of the proper time t measured from some point along it. 
The other co-ordinates of the particle, if any, need not be specified for the present 
work. A dot denotes differentiation with respect to r, and is used to denote 

the velocity of the particle, a;^ denotes a point of space, s^=x^— z^(t 0 ) the distance 
from any point of space to the retarded point r 0 on the world-fine defined by 


= 0 . ( 1 ) 

If t 0 be kept fixed then equation (1) is also the equation of the light cone whose 
apex is at r 0 . We further introduce a quantity k as a function of the co-ordinates 
x 11 ' defined by 

ACSS^To). (2) 


The energy-momentum tensor is denoted by T- :cv . 
equation 

ZTr 
dx> ~ °* 


It satisfies the conservation 


(3) 


It is necessary to find the generalization of Gauss’s theorem to four-dimensional 
hyperbolic space. Let 

C°(a^) = a (4) 


be a closed three-dimensional surface S surrounding a four-dimensional volume V 
such that the surface P(xt L ) = oc—Aa for positive Ax is contained inside (4) and lies 
wholly in the volume V. Let p, P and P be parameters defining the position of a 
point on the surface S. Then, if X“ be any tensor, the generalization of Gauss’s 
theorem reads 

jj^-dx°dx'dx*dx* = I J> (5) 


where D is the determinant of the transformation from the to the £’s and the 
surface is covered in such a way that d^ 1 , d£ 2 , d£ 3 are always positive: 


dp 

a P 

a? 

3^ 

3a: 0 

3a: 0 

3a: 0 

3a: 0 

sg° 

dP 

8P 

dp 

3a; 1 

3a: 1 

3a: 1 

dx 1 

1 CD 

1 o 

3C 1 

3^ 

dp 

dx 2 

3a: 2 

3a: 2 

dx 2 

9£ 

3S 1 

dp 

3 P 

dx 3 

3a: 3 

3a: 3 

dx 3 


(6) 



On the theory of point-particles 


137 


3£°/3»“ is the normal to the surface S. For a displacement (df^/dx^Al in the direction 
of this normal with positive Al the change At? of £° is 




0go a^o 
dx^dx,. 


Al. 


This is positive if d^/dx* 6 is a time-like vector and negative if it is space-like. Thus 
on the space-like portions of the surface the normal to the surface on the right-hand 
side of (5) is directed outwards , and on the time-]ike portions it is directed inwards . 
3 £° 

By writing dS v = ^ | D | ” 1 d?dt?dt? for an element of the surface S with its normal 
directed in the sense defined above, the right-hand side of (5) can be written 



(7) 



Consider a portion of the world-line between the points r x and r 2 and surround it 
by a world-tube of any arbitrary shape. We consider the ends of the tube to be given 
by the two two-dimensional surfaces where it intersects the future light cones from 
the points r x and r 2 respectively. The flow of energy and momentum into the tube 
is then given by 

pT^dS v , ( 8 ) 

the normal to the surface being directed outwards. For conservation, (8) must be 
put equal to a function of the form A^(t 2 ) — A^(t 1 ), where A^(r) is a function of v? 
and the other co-ordinates of the particle, possibly also of the field quantities, and 
their higher derivatives at the point r only. On differentiating this equation 

^j T T^dS v = A^t). (9) 

This is not a mathematical identity but furnishes the equations of motion of the 
point-particle. It should be noted at once that the equations derived in this way are 
independent of the shape of the tube. For if the given tube is surrounded by another 


138 


H. J. Bhabha and Harish-Chandra 


of some other shape, then it follows from (3) and (5) that the difference in the 
integral (8) over the two tubes is just equal to the integral over the portions of the 
light cones at r ± and r 2 intercepted between the two tubes. In the limit when the 
size of both tubes is made to tend to zero, the integrals over the light cones just 
become functions of the conditions at r 1 and r 2 respectively, and hence the difference 
in (8) calculated over two tubes of different shape is identically of the form 
A f ^{T 2 )—A f ^(r 1 )., This difference can be absorbed into the right-hand side of (9), 
and hence it does not affect the equation of motion which can be so derived. One 
may therefore work with whatever tube is most convenient. 

Consider now a volume V lying between the two light cones S and S' starting 
from the points r' and r' + dr' respectively, and between the two tubes P and Q 
defined by k = e and k = rj, as shown in figure 1 . The boundary of V so defined can 
be characterized by the equation £° = 0, where £° is a discontinuous function chosen 
in the following way: 


t — (r' + dr') 

on S', 

K—rj 

on Q, 

e — K 

on P, 

t ' — t 

N. 

on S. 


Since (3) holds inside this volume V it follows from (5) that 


f Tf w dS v - f T^dS, = f T^dS v - f TrdS y9 (10) 

J T—r'+dr' J t=t' J K—e J k—ij 

the normals to the light cones being taken in the direction of increasing r in both 
cases, and towards the world-line on the two world-tubes. To evaluate the second 
integral on the left it is convenient to take £* = k, = s 2 and = s 3 in addition to 
= r. As shown in the previous papers, it follows from (1) and (2) that 


dr = s /L 
dx* 1 k ’ 


( 11 ) 


dfc 

dxl L= K it- v i* 


~{1 — k'), where /r' = ^s^, 


( 12 ) 


9s 2 
3 xf 




3s 3 

dx* 




K 


(13) 

(14) 


The right-hand sides of (11)-(14) have to be inserted as the columns of the matrix 
on the right of (6). Since the determinant of a matrix vanishes if it has any two 
columns identical, it follows that we may omit all except the first terms on the 
right-hand sides of (11)-(14) and get 

J) — 


K 


(15) 



On the theory of point-particle s 


139 


TKerefore 


f T^dS v = ('(TV'' 1 '- 

J r=const. JJJe K 


SqV-^-S^V, 


1^0 


ds 2 ds z dK 


-if 


Tr^dQdK, 

K 


(16) 


<£Q being the element of surface of a sphere of radius k in the ‘rest system 9 in which 
v 0 = 1, v x = v 2 = = 0. Similarly, 



T^ds^ 



T^ v K v dQ. 


( 10 ) now becomes 


T> lv K v dQ. 


(17) 

(18) 


It is important to note that the left-hand side of (18) is not a perfect differential. 
The integral does not depend in general only on conditions on the world-line at the 
point r. Differentiating this equation with respect to e, we get 

4- f Tnc-dO+4- f T^—dQ = 0. (19) 

dicJ K drJ K k 


As k~^ 0 the second term in (19) becomes a perfect differential, and hence if there are 
any singular terms in the first term of (19) these must be perfect differentials and be 
compensated identically by those in the second term. (19) is the exact statement 
that the singular part of the inflow is a perfect differential. In case the energy 
tensor can be expanded as a series in ascending powers of k for sufficiently small 
values of k starting with some negative power of /c, then (19) shows that all the terms 
except the term independent of k in the inflow must be perfect differentials.* . 

The singular terms being perfect differentials can always be compensated by the 
addition of suitable terms to Aft in (9) and therefore play no part in determining the 
motion of the point-particle . The term independent of k is not a perfect differential, 
and by putting it equal to a suitable perfect differential as in (9) the equation of 
translational motion of the point-particle is obtained. It is seen that to determine 
the equation of a point-particle it is only necessary to calculate that part of the 
inflow which remains finite as k-> 0 . The rest can be ignored. 

' It is clear that the same argument holds if two of the boundaries of the volume V 
are taken as any two surfaces passing through the points r and r + dr instead of the 
light cones. The argument also holds for any other tensor which satisfies a con¬ 
servation law of the type (3), and therefore for the angular-momentum tensor of 
the field. In finding the rotational equations of the particle it is therefore only 
necessary to calculate the finite part of the inflow of angular momentum and to 
put this equal to a suitable perfect differential. 

^ 2. Let be a tensor antisymmetric in fi and v. Let us suppose that the co¬ 
ordinates £°, f 1 , £ 2 and £ 3 of the last section are so chosen that £*■ = ft with constant, JS 


* This result is implicit in Mathisson’s paper ( 1940 ) and can be derived from his variational 
equations, but it is much more cumbersome and indirect to prove it by his method. 



140 


H. J. Bhabha and Harish-Chandra 


represents a closed two-dimensional surface o) forming the boundary of a portion 
8' of the three-dimensional surface g° = a, and assume that g 1 = /?— Af} (Afi positive) 
is a two-dimensional surface lying entirely within the three-dimensional surface "so 
enclosed. Then it is well known that 

jJ fS 1 D i •® (so > 


In this formula let ns insert a tensor j£> cr antisymmetric in v and cr in place of X v<r , 
and take for the surface £° the surface of the tube k = e between the points r x 
and r 2 . It is convenient in this case to take as the variables £, £° = k, Q = r, £ 2 — s 2 , 
£ 3 = and we obtain, as in the last section, remembering (11)-(15), 





az^ 

dx* 


K v dQdr 




K—dQ 
a: 


= K^Vy—dQ 
f * /c 


( 21 ) 


on account of the antisymmetry of K^ v<r in y and cr . Differentiating this with respect 

( 22 ) 


The inflow of dK^/doc? over an infinitesimal portion of a world-tube of any arbitrary 
shape can similarly always be shown to be identically a perfect differential. Hence, 
if the energy-momentum tensor is replaced by a new one T fjiV defined by 


T'jiv — /JtflV 


dK^* 


(23) 


which obviously also satisfies a conservation equation like (3), then the inflow is 
only altered by the addition of perfect differentials, and these do not alter the equa¬ 
tions of motion since they can be compensated by the addition of suitable terms 
to A? in (9). 

Denote by that part of the inflow, calculated by using the original tensor 

which is a perfect differential. depends only on the variables at r. Then taking 


1 B* 

J£>cr _- {v v SV-V a & v \ 

4:1TK S 


(24) 


and inserting it into (22), we get, remembering (1), (2), 


j* dK^ vcr 
Jk to* 


K v dQ = — . 2 ^. 


(25) 


Since it has been shown that the singular parts of the inflow are always perfect 
differentials they can be included in B^, and it has therefore been proved that it is 
always possible to find a tensor K^ v * such that the inflow calculated from the 
modified tensor (23) is finite over the particular world-tube chosen. 



On the theory of point-particles 141 

The tensor (24) is by no means uniquely defined. Thus 

1 J3/* 

K flv<r = - &#) (26) 

could equally well have been taken, with b a positive constant and n a number 
greater than the highest order of singularity in B^. This tensor would in fact have 
the advantage over (24) that its integral over any surface at infinity would always 
vanish. 

The integral of the modified tensor over any arbitrary three-dimensional surface 
is not necessarily finite, nor is the inflow finite over a tube of any other shape than 
the one chosen above. The general properties of the energy tensor T^ v are investi¬ 
gated in more detail in the paper by one of us (H.-C.) which follows this, and it is 
shown there that it is always possible to find a tensor 1 £> cr such that the modified 
tensor has no singularities of order higher than the third. 


References 

Bhabha 1939 Proc. Roy. Soc. A, 172, 384-409. 

Bhabha 1940 Proc. Ind. Acad. Sci. A, 11, 247-267, 467. 
Bhabha 1941 Proc. Roy. Soc. A, 178, 314-350. ' 

Bhabha & Corben 1941 Proc. Roy. Soc. A, 178, 273-314, 
Dirac 1938 Proc. Roy. Soc. A, 167, 148-169. 

Mathisson 1940 Proc. Camb. Phil. Soc. 36, 331-350. 
Pryce 1938 Proc. Roy. Soc. A, 168, 389-401. 


Vol. 183. A. 


ID 



On the removal of the infinite self-energies of point-particles 

By HabishT-Chandra, J. H. Bhabha Memorial Student 
Cosmic Bay Research Unit , Indian Institute of Science, Bangalore 

(Communicated by H. J. Bhabha, F.B.S.—Received 20 March 1944) 


A general method is set up for modifying the energy-momentum tensor so as to remove 
the singularities in the flow of energy and momentum into the world-line of a particle 
without affecting the equations of motion of the particle. It is shown how the singularities 
of different order may be removed one by one. 

In the case of the electromagnetic and meson fields it is shown that the modified tensor 
leads to a finite integral of energy and momentum over any space-like surface. In other 
cases the corresponding result may be secured by making a further modification in the 
tensor. 

t 

1 . It has been shown in the preceding paper* that all singular terms in the 
expression for the rate of inflow*)' of energy and momentum are perfect differentials, 
and therefore the equations of motion of a point-particle calculated from considera¬ 
tions of the conservation of energy and momentum are always finite. One would 
now expect to be able to alter, without disturbing the equations of motion, the 
energy-momentum tensor of the field so as to make the inflow finite, thus avoiding 
completely the appearance of infinities which are entirely spurious, at any rate, 
from the point of view of the equations of motion. It would be still better if it could 
be arranged that the total energy and momentum calculated from the modified 
tensor 3V were finite. This of course would automatically secure a finite inflow. 

It is proved in A that the replacement of the energy-momentum tensor by 


i 


f’/iv __ 'J'uV I 
\ 


3_£>cr 

“aF"’ 


( 1 ) 


where K^ V(r is antisymmetric in (v, cr), does not alter the equations of motion. 
Therefore an attempt should be made to alter T& in accordance with ( 1 ) in trying 
to get rid of the infinities. This method has already been used by Pryce ( 1938 ) to 
make the energy and momentum of the field finite in the case of the point electron. 
It will be shown quite generally in this paper that given an energy-momentum 
tensor satisfying the conservation equation 


dT** 

dx v 


= 0 , 


( 2 ) 


a tensor K^ LVCT can always be found so as to make the energy-momentum integrals 
calculated from finite provided Ti lv fulfils certain very general restrictions. In 
general, Kf ivcr is not symmetric in /i, v even when T^ v is so, and therefore the modified 

* Bhabha & Harish-Chandra (1944), referred to in this paper as A. 
t This term is used in the same sense as in A. 

[ 142 ] 



On the removal of infinite self-energies of point-particles 143 

tensor 2V is also not symmetric. But it will be shown that in the case of an electro¬ 
magnetic charge or a dipole this symmetry can be achieved. However, the sym¬ 
metry of the energy-momentum tensor, which is needed only for the purpose of 
constructing an angular-momentum tensor, is not necessary here, because it will 
be shown that a similar process can be applied to the angular-momentum tensor 
also, so that the total angular momentum calculated from this modified tensor is 
finite. 

2 . Keeping to the notation of A it is now necessary to determine Equations 

(18) and (19) of A are 


( 3 ) 

W 


= J T<"K,dO-j T^dQ, 

4 f T^—dQ = --f- f T^kJQ. 

drJ K k dtcJ K 

Here J denotes integration over the sphere of intersection of the light cone at r 

with the tube k = constant. This sphere will be called the ‘retarded sphere’ of 
radius k at r. In the appendix (equation (98)) it is shown that 

- 7 *°- JJs? < 5 > 

where a, ft, ..., A are any number of free indices. Now introduce two operators D 
and d defined in the following way: 

d 


D = K 


die’ 


( 60 ) 


QJJat S...Av _ —— 

q P)TTa/3.*-cr 

= 2(7 + ( 66 ) 

where TJ a P'” Xv is any arbitrary tensor with any number of indices. It should be 
noted that 

• (7) 


owing to the antisymmetry in v,cr of the expression inside the brackets in (66). 
Obviously both D and 6 are linear operators. Also Dk u — rue* 1 and therefore by 
repetition D r K n = n r K n , so that if /(D) is a polynomial expressible in powers of D, 
f(D) k u = f(n) K n . This holds whether n is positive or negative. 

Using D and 6 , (5) can be written in the form 

D f T^K v t...-dQ = f 6T^.K v -...-dQ. 

J« K K J K V K K 


10-2 



144 


Harish-Chandra 


Since dT^ is conserved with, respect to v as 2 T “" was, the same considerations as 
above can be applied to dT>‘ v instead of and hence 

2)2 = D I (6Tr)K,-.. .-<20 = e*Tr.K,-...-dQ. 

J K K K J K K K J K K K 

By repetition it can be proved that 

f oa oA (* qCt «A 

D r T^k — ...—dQ — 6 r T^.K v -...-dQ, 

J k K K J K V K K 

and therefore /(D) f T^K,— ...-dQ = f f{S)T^. k v -...- dQ, (8) 

J K K K J* V K K 


where/(D) is a polynomial expressible in positive integral powers of D. 

It will now be assumed that the part of which becomes infinite as /c->0 can 
be expressed for sufficiently small values of k in the form of a series involving a finite 
number of negative powers of k , so that for sufficiently small k one can write* 

r==m-f2/v/iy 

?> = ir+ s ^=p, (9) 

r= 1 K 


where T$ v remains finite as /c->0 and afL r) are functions of s a /K and r only. It is 
assumed that these functions are finite and continuous. That the energy-momentum 
tensors of the electromagnetic and the meson fields actually satisfy these require- 
*ments is shown in the appendix. Therefore 




n. r—m r ticc..J 

: dQ = c^+ 2 ^ 


( 10 ) 


can be written, where c^ a ” A is a function of k and r which remains finite as /c->0 and 
cfly y A are functions of r only. It is assumed that for sufficiently small k, c$ a --* A is 
expressible as a series in positive powers of k . This also is true for the electromagnetic 
and meson fields. 

Applying the operator/(D) to both the sides of (10), 


r qA r~m -/ta... A 

m T^„ S -...-dQ=f(D)c^+ S/(-rP=?-. 

J k K K r= 1 K 

k=m [ T)\ 

Taking /(D) = II (l + ^), 

we get/( —r) = 0 for 

From (11) and (8) it follows that 



( 11 ) 

( 12 ) 


(13) 


* The highest singularity is written as of the order m+2 for convenience. The removal of 
only those singularities from Tp v is necessary which are of an order higher than the 2 nd, since 
singularities of the 2nd or lesser order give a finite energy integral (Pryce 1938 ). 



On the removal of infinite self-energies of point-particles 145 

Since the right side of this equation is non-singular, the left must he so, i.e. 

mj Q\ 

\ 11 + t) 2 7|H ” gives a finite integral. 


k—m 

n 

k 


From this it follows that 


sbi) 


TvAk, 


(14) 


cannot contain any singularity of an order higher than the 2nd, because if it contained 
a term of the type 6^//c r+2 (where r > 0). this would give after integration a singular 
term of the type where 

r. s a s * r dQ “1 

\do) = — = the element of solid angle . 

Since such a term is absent from the right side of (13) it follows that 


f s a 

.A _ [fop — — fi(jj 0. 

J K K 


(15) 


As in the rest system s a /tc are nothing else than the direction cosines (except s°/k 
which is 1), it follows from the expansibility of every continuous function defined 
on the surface of a sphere, in terms of surface harmonics, that b p = 0. Thus (14) does 
not contain any singularities higher than of the 2nd order. 

k=7U/ Q\ 

Because n (l + ^l = 1 +ad + bd 2 +... = l + + b &+...), 

where a , 6, ... are numerical coefficients, it is clear that 

fc=m/ Q\ 

n I l-f = Tfi v + 6(a + b8 + .>.) T** = T^ + dK* 11 ’, 

where J£> == (a.+ bd +...) 2V“\ 

Further, as #£> = dK^jdoc?, where K^ V(r = — IO lcr s v , the modified tensor 




k—m 

] 

k 


which is just of the required form. Put 

k=mj Q\ 

n ( 1+ d Tflv - ( 16 ) 

It should be noted in passing that it can be shown directly that T'* lv does not 
contain singularities of an order higher than the 3rd, because, as shown in the 
appendix (equation (102)), 

fk 06 \ 06 A 

i> T^-v tr v v )-...-dQ = .(*'-»,*»)- ...-dQ. 

J k K K J K K K 

Hence 

{* <%a <jA ft—m / n\ /* <?a «A 



146 


Harish-Charidra 


Through the same reasoning as used before it can be shown that, since the right side 
of (17) is non-singular, so also must be the left. From this it can be inferred exactly 
as in the case of T'/ iV K v that —v^v v ) does not contain singularities of an order 

higher than the 2 nd. 

It is further found (appendix equation ( 100 )) that 

(* nCC qX f* aCC oX 

D T^s v -.. -dQ = dT^.s v -...-dQ. 

J* * K Jit K K 

C Q a qA k=m [ T)\ C ©A 

Hence j T're,-...-dt2 - Trs,-...-dt3.. (18) 

From this one infers precisely as before that T'^s v does not contain singularities 
higher than of the 2 nd order, so that T'^ v -■ or T'^ v - 1 (l — k') cannot contain sin- 

K K 

gularities higher than 1 /zc 3 . Since it has also been proved that 

T’^k v = T'A»j Vi ,_^(i_^)j 

does not have singularities higher than of the 2 nd order, it follows that T'^ v v v does 
not possess terms of order higher than 1 /zc 3 . Now write T'^ v in the form 


T'p = T'f i(r {8*-v (r v v ) + (T , t l<T v a .)v v . 

The first term does not contain terms of order higher than 1/a: 2 and the second of 
order higher than l//c 3 . So the highest singularity in T'^ can at most be of the 
order 1 /zc 3 . 

3. We now proceed to examine in detail the process of removal of the singular 

Q\ 

terms of T^ v by the operator n I 1 -f 7 ) • In addition to 6 we define another operator </> 

k=i \ to] 

by the equation 

<j>U a - v = 2U a "- v +s*-^— + U«"-'^v v , (19) 

OX K 


so that from ( 66 ) and (19) is obtained 


eU“" v = <f>TJ a - v - & 

When U c ‘--- rT is conserved with respect to <r, 

0U*...v _ fijja.. 


8 

dx* 


( 20 ) 


( 21 ) 


While 0U a - v is always conserved with respect to v,<j>U a '‘- v is conserved in general 
only when ZJ a - v itself is conserved. 

Let Tf!L n) denote the nth. order term in T^ v so that in accordance with (9) 


8 TfL, 


<f>T*L n) = 2 T£L n) +s"-^ + Tf?L n) ^ 


v\ 



On the removal of infinite self-energies of jpoint-particles 147 

As before assume af- n ) to be a function of the arguments s*//c and r only. Now both 

0 

r and $ a //c are to be treated as constants with respect to the operator 6*°'^-^., because 

ox 


ny 


( 22 ) 


(23) 


s" = 0 and s* ] = 0, so that s a - f ‘ — 0. Therefore 

da? dxf\K) da? 

da? K n +1 ^ ~ K n ~ nl l~n> 

Thus 4>T(l n) = (2 - n) Tf_ n) + Tj ? n) = £(2 -»)£+«* JJ Tff 

Now since = 0, it follows that 

dxP\K J 

mi n) = [( 2 - n)S; + if *A [( 2 - n)$? + v?TfL n) . 

By repetition f l TfL n) = ^(2 - n) 8+v £j j 

for any positive integer Z, where 

-[(2-»)^ + t^][(2-»)*£ + t*2e] - [(2-») + 


(24) 


Put 

ad* + bv v ^= (a, b )J 

(25a) 

and 

[{ aS+bv fJ?[ AKa - b) l- 

(256) 

The indices will sometimes be omitted. For example, we shall write 



^2U»-4*(2-n,l)2U), 

(26a) 

meaning thereby 

4> l T(l n) = \A\2-n, l)JVf£ n) . 

(266) 

It is also seen that 

cA(a,6) = A(ca ; c6), 

(27 a) 


A(a l5 6 1 )+A(a 2 ,6 2 ) = A(a 1 +a 2 ,6 x + & 2 ), 

(276) 

and 

c = A(c, 0), 

(27c) 

where c is an ordinary number. On using (26) and (27) 



£[i+ 4(2 f - n/(i + 2 /,|) 

.(28) 



148 Harish-Chandra 

Consider now A{a x , b x ) A(a 2 , b 2 ). It is found that 

(«i K+ h x* 8 i) (m$+z>2^) 

= a x a 2 ^ + [K+ b x ) (a 2 + b z ) - a x a^ if - 


= J^-4 (a x a z , \a x 4- b x ] [a 2 +6 2 ] — a i a 2 )J^- 


It can easily be proved by induction that. 

A,{cl^ bj) A.(<Xfy 6 2 ) * -^(^5 ^7i) 

= A(a x a 2 ... a n , [%+b J [a 2 + b 2 ]... \a n + b n ] - a x a 2 ... a n ). (29) 

Thus we get immediately 

Hence if m > n — 2 and n > 4, 

k=m 


Thus 


m 


T(_\ = 0 for to + 2 > % 4. 


Since to+ 2 has already been chosen equal to the order of the highest singularity in 

Tf‘ v , it follows that the operator fc rf|l+|j when applied to T' w removes all 

singular terms higher than the 3rd. For the term of the 3rd order, from (28) 
and (30), 

; n”(i+|) au - mo , i)zu = 


Thus 


KH) r " -XH) T -» + 


where T^ 2) denotes the part of remaining after omitting terms, of 3rd order or 
higher.* 


* T^ n) denotes the part of left over after omitting terms of order higher than the nth, 
and T^l n the part obtained by retaining only terms of the nth or higher order, so that 

T^ n+1) +T^ n = T^. 

The suffixes —n) and (— n attached to any other quantity have analogous significance. 
Similarly the suffix (—n) always denotes the term of the nth order. 



On the removal of infinite self-energies of point-particles 149 

Since T^ lv is conserved with respect to v, $T* lv = 6T^ V and <f> l T^ v = 6 l T^, so‘that 

k=m[ ' Q\ k=m( A\ k=mf A\ q 

t'p =jn +jrj t *=jn i^i+|j =jn (i+|j 2 %+(3i) 

which is in conformity with the proved fact that T^ v does not contain sing ularities 
higher than of the 3rd order. 

In accordance with (15) 

k-m/ (b\ 

= 1 +<fi(a + b<fi + ...), 

so that 

T'^ = 7%+ <j>{a+b<j> + -..) 7 % + TV^ ^ 

/c 

= ?% + 0(a + 60 + ...) 7% + Tff 3) %" + .s" ~ [(a + 6^ +...) . ( 32 ) 

A new tensor T"^ v is now introduced, defined by 

T »p, = qyr _ e(a+b<f> +...) 7% = 3% + Tff 3, %"+ 5 " ^ [(« + H + • • -) 7%]- (33) 


0 J g> £r 


T"^ is obviously conserved and is of the form + — ■ , where K^ V<T is antisyra- 
metrical in i>, cr and therefore T"^ instead of T'^ may be taken as the modified 


tensor. 

It will now be shown that 


A[ ( »+^+...) rej = «[^] ( . 


■3) 


dT^ 

dx* 


(34) 


It has been seen above that the order of a term is not changed by application of the 
operator <fi. Hence (a -f b<f> +...) T^ 2) contains no singularities higher than of the 2nd 
order. Since ■ T{1 3 = 7>-7%, 

it follows from the conservation of and therefore of (a + b<j> +...) that 


^[(a+ty+.-orsy = -l-[{a+b<j>+...)T^ z \. 


(35) 


Now 

37^ 

dx* 


3^») 


(36) 


= — j -— * and 
3r 


>/> = 




where 


(37) 



150 


Harish-Chandra 


treating r, sJk as independent variables. It follows therefore that on differentiation 
T^L n ) can give terms only of the nth and n + 1 th order. The same result is obviously 
applicable to terms of different orders in (a + b<f> +...) Thus the highest sin- 


gularity in ...) T^ v ~\ is of the 3rd order, which is also the order of the 

ox 


lowest singularity in 


_0_ 


[{a + b<f >+...) Tff s ]. From equation (35) it follows therefore 


that %-^[(a + b<f >+...) T^\ contains only one term of the 3 rd order, because the 

OOu 

terms of lower order which it could contain do not exist on the right side of (35). 
Since only the 2 nd order term in {a + bcf> +...) 2^? 2) can contribute to a 3rd order term 
in the divergence, it is sufficient to calculate the 2 nd order term of (a + ...) 

and find out the 3 rd order term that results from it on differentiation. 

Now proceed to determine (a + b<fi +...) Tf£L n y From (26), (27) and (29) it follows 
immediately that 

(a+b<p + ...)T ( _ n) = A(p,q)T { _ nh (38) 


where p, q are some numerical numbers. However, on using (27)-(30) one gets 


"H) 


T (_ n ) = [1 brf> + ...)] T(_ n ) — T(_ n )+<fiA(p,q)T(_ n ) 

= T ( _ n) +A(2-n,l)A{p,q) 

= A(l+p{2-n), q(3-n)+p)T ( _ n) 
rk=m/ 2 -n\ fc =?* 7 , 3 -ri 


= A\ n 1 + 


L_fc— l V 


k 


\ k=mf 3 — n\ 2 —n\~ 

hS,[ 1+ —)-SA 1+ —) J 


T, 


(~n) m 


Thus 


)-*]■ 


2 —ri 


)• 


i T“Jc=m / 

-i=La( 


i + 

i + 


2 — n\ 

~T 

3 — ri\ 


2 —ri 


) + 2^_ ' 


(39a) 

(396) 


•By passing to the limit n -> 2 it is easy to show that in this case p = a = 1 + J + ...+ 
and q = m—a. Therefore 


^1(0+14+...)^ = {^[ ar f - 2 )+(^ -“) ^)^"]} ( _ 3 / (40) 

the suffix denoting that only the 3rd order term resulting after differentiation is 
to he retained. Now, since 


a 


( 41 ) 


rH I ^ 



On the removal of infinite self-energies of point-particles 

it is found that 


151 


±_r Tr y] = y g») (42) 

Sa^L. J /f So: 0, a: |_ K n K n k k K n /c J ’ 

where the various symbols have the same meaning as in (37). Since a (f n ) 

are finite functions of s^Jk and r only, it is obvious that * s of the 


nth order. So that 


_3_ 


By comparing (36) and (42) it is also seen that 

K dx* 1 _ cxP J ( _ n) ’ 

a result which will be used later. So from (33), (40) and (43) 

9 [dT^ v ) 

T"^ = 2% + ~v u + p~g^rj a - 

It should be observed that 


(43) 


(44) 


(45) 


d_ 

dx? 




{ g rpjia 

- x - ~ 2) } is of the 3rd order, and therefore by a relation analogous to (22) 

J(—3) 


, 9 f 8?T f-2)) = 

- (—3 


dT(' 2) 


fa? 1 (—3) * 


dx v \ dx* j(—$) 

r dT ll(T ) 

Now since is conserved with respect to v, as also g^r j 5 ^ follows 
from (45) that 

[y f 5 S) ^] = o. 


dT^s 3 
>+; 


dctf dx v 

Further, using again the relation expressed by (22) 


(47) 


-,m from (47). 

OX? 


(48) 


Employing an argument similar to that used in deriving (35) and (40) it can be 
shown that 

i)Tt% _ dTfr &TfZ * -1 rsyffs)"] 

3 xp dx? L ° xP 1(— 3 ) L dx? J ( _ 3 )' 


( 49 ) 



V. 


152 Harish-Chandra 

Therefore from (45), (48) and (49) 

It is easy to see from (15), (16), (33) and (50) that 

T "p* — T^ v + 6 K^ V , 

where 0 K^ V = 0 {cl -{- b<f> 4*...) T^ lv — d{a + b<j> ■+*...) T@ f— 3 ) ~ 


(50) 


= 6 (a + b<f) + ■••) T^L 3 — 6 
We can therefore write 


aTrn—i? . 

^ J 


Kt‘ v = {a + b<f> + ...) Tf_ z -aTfl ^-f if. 


Using (39) one gets explicitly 

J^fw _ [~rppv _ Ji/ur iz v 7\ [" 4. — -1 

A [_' U ( - 3) ^ J ^U-2 (»-2)(»-3)J- 


Thus 




(51) 

(52) . 


and is therefore of the required form. 

Now T'"> u ’k v = TZZz)< v + T X-s) ~ K ' an( i therefore it possesses terms only up to the 
2nd order, thus giving a finite integral over the world tube. 


„ d f s 0 \ d s. d 

Sin “ 

it follows from (36) and (44) that 


1st y\ 8gT E«> p^n 
\ p K J doc? L dxP J(_ re _D 


m*- r , 


(53) 


(54) 


(44) and (54) hold not only for T^.'. n) but also for any term of the nth order which is 
expressed as a Ik 71 , where a is dependent on s“//r and r only. 

In view of (44) the conservation of T"'t w is easy to understand. Because, using (49), 


W" PiSsl + ± V T „ Uf\ 

daf [_ dcf J(_ 3 ) dx v ]_ ( 3) k 


= " 3r f-2) 

L dx p 


! L + ^ ? ^ from<ii) 


- ° from (w> md ( * 9> - 



On the removal of infinite self-energies of point-particles 153 

Precisely in the same way it can be proved that 




(55) 


is conserved with respect to v. Further it can be shown that if w, ^ 2 

s dK^ 

T>» n) + T£ -2 if = Tr + 

K 


dx* 


where is a tensor antisymmetrical in v, cr. This follows from the fact that 


8^ = - T m P’ + TZ n) + T^ n _ i) — v v 

K 


= {*1-^*) T 1~ S)+ T ?-*>+ - + T t-n)+ T(- n -ifa” 


is conserved. It is then obvious that exactly as in the case of T* lv a tensor 

<? P K'mw 

S'" 1 " = + ^ + 


can be formed, where K^ v<x is a suitable tensor antisymmetrical in v, cr. But since 
8 ** does not contain terms of the 2nd order or lower, and 


= vv ~{ s z~~ vP ) n-t )=°> 


dK'v™ 

then S m ^ v = 0, i.e. 8 ^ v — -«-, so that 

dx* 


Tf_ n) +T{Z n L 1) —v v = T' 1 '^- 


dK'fi™ 

dx a 


T^ v + 


d& v<T 
dx* 


Thus it is found that there exist a large number of tensors formed according to the 
simple rule (55) and capable of being put in the form (1) in which the order Tc of the 
highest singularity can be any number such that 3 < Tc < m, 

4. It will now be proved that the 3rd order term in T m ^ v does not give rise to a 
singularity in the energy-momentum integral, in the case of the electromagnetic 
and meson fields. 

It has already been shown that T ,!, ^ v k v has terms only up to the order l//c 2 , even 
though T m ^ v contains terms up to the order l//c 3 . If equation (4) is rewritten for 
then 





154 


Harish-Chandra 


Since the right side of this equation is finite the left side must also be so. It is, how¬ 
ever, evident from (50) that the left side contains a singular term of the order 1 /k 
which is 

<56) 

Therefore this term must be put equal to zero. Therefore 

(57) 

where c p are constants. The physical meaning of this is obvious. Because the rate of 
outflow of energy and momentum from the particle to the outer space, given by 

T m ^ v K v dQ is finite, any singularity in total field energy or field momentum must 

J /c->0 

remain independent of t.* 

It is now assumed that for values of r very far in the past the world-line is straight 
so that the particle (whose motion the world-line represents) was moving a long time 
back in a straight line. It is further assumed that at that time it had no rotational 
motion. Consider now the following motion of the particle. In the remote past it 
was moving uniformly in the absence of an external field. It is then acted upon by 
an external field which causes acceleration. Finally, the external field dies down and 
the particle has again in the distant future uniform unaccelerated motion but with 
a velocity entirely different from the one it had far back in the past. If such a motion 
is permissible, then since cp is independent of r, it must remain the same in the 
initial and final stages of motion, i.e. it should be the same whatever the velocity of 
the particle in the course of its uniform motion. This means that is independent 
of the choice of the Lorentz-frame of reference. The only tensors which are so 
independent are 0 and g^ v . The latter alternative is obviously inadmissible. There¬ 
fore cp = 0. 


[k~ 7] r s- 

I dK Tp° — dQ is not really the energy and momentum contained inside the volume 

J K=e J k K 

enclosed between the two spheres k = e and k = 7 . Energy-momentum is in fact defined as 
the integral of TP V over a plane three-dimensional space-like surface. However, if in equation 
(10) of A we take S as the plane defined by [xP — zP(r)] v^t) = e (which is nothing but the space of 
the rest system taken a time e later than t), it is easy to see that P intersects S in the retarded 
sphere of radius e. Remembering that for the plane dS v —v v {r) dV, where dV is the element 
of three-dimensional volume, relation (10) of A for T m P v can be written as 


d_ 

dr 


f r"n v dv= 

J s 



T"'P v fc v dQ-T"'P(Q), 


where T !f, P(Q) is the integral over the two-dimensional surface of intersection of S and the 
tube Q. It can be directly inferred from this that the singularity in T m P v v v dV is independent 
of r. 



On the removal of infinite self-energies of point-particles 155 


In the ease of the electromagnetic and mesic particles it is usually assumed that 
such a motion is possible.* Therefore 

d 1 = 0 (58) 

for the corresponding fields, r Thus the integral of T over the light cone 

( pd/cf T m ^dQ\ 

\J Ki J K % / fC t -»-0 

is finite. From this it will be proved that the integral of T over any finite space¬ 
like surface is finite, irrespective of the manner in which the point of singularity is 

s 

approached. One need consider only the 3rd order term Tf* Z) —v v , since this is the 

K. 

only one which can give rise to a singularity in the integral. 

Let the space-like surface be 

g°(a?J = constant, (59) 

and let it intersect the world-line at c. Using the notation of A, the integral is 

1 D \~ ld ^ d ^ 3 - ( 6 °) 

Take g 1 == x 1 , g 2 = x 2 , g 3 = x 3 , where x\ x 2 , x 3 are measured in the rest system at c. 
Then D = 9g°/3#°. Now let c be surrounded by a two-dimensional closed surface 
lying in (59) and given by the equation 

gi(^) = constant = e, (61) 

such that as e->0 this surface tends to the point c. It will be assumed that for 
sufficiently small values of e (61) is always entirely enclosed between the two two- 
dimensional surfaces resulting from the intersection of the two tubes 

k = ae, (62) 

k = fie, ( 63 ) 

with (59). Here a and /? are constants (a < /?) independent of e. Now calculate the 
integral (60) taken over the portion of (59) lying between (61) and (63). For points 
lying in this region, it follows from the Taylor expansion of v v that for sufficiently 
small values of e 

| v v _ (v v ) Q | < Ae, 

where (^) 0 is the velocity at the point c and A is a constant independent of e. There¬ 
fore without affecting the singular terms v v can be replaced by (v v ) 0 in (60). Remem¬ 
bering that ( v v ) 0 = (1, 0 , 0 , 0) and D — 3g°/3a? 0 one gets the integral as 

J — dx^dx 2 dx 3 . (64) 


* In fact it may even be necessary to demand that only such motions are permissible 
(Dirac 1938 ). 

t This can also be proved directly, from the form of the energy-momentum tensor of the 
electomagnetic and the meson fields. 



156 


Harish-Chandra 


Put 


rp^ *, 

( 3) K K 3 K K 3 ’ 


so that a/ 1 is a finite function of r and s“//c only. Let ] a,> 1 \ m be the maximum value of 
i | in the region of integration of (64). Then 

dx 1 dx 3 dx 3 < f K ~ fie dx 1 dx 3 dx 3 . (65) 
(0C6) J K= =(xe 


J C 1=s e 


/c° 


dx 1 dx 2 dx 3 < 


r 

J?- 


m #a* 


The spatial volume enclosed between k — oce and k = fie is obviously of the order e 3 . 
The right side of (65) is thus finite and hence so also must be the left. Now it is seen 
from figure 1, where AB is the world-line, L the light cone and Q any finite tube, 
that since the integrals of T m * lv over P and L are non-singular the integral oyer the 
portion of S intercepted between P and Q must also be so. It has just now been 
shown that the integral taken over the portion of 8 enclosed between P and £* = e 
rema ins finite as e-^0. Therefore the integral over the portion intercepted between 
g 1 = e and Q also remains finite as e->0. It has thus been proved that the integral 
over 8 is finit e, no matter .how the point of singularity be approached.* 


B 



Figtjbe 1 


If T m i LV be integrated over the region of S enclosed between = e and £ x = 7/ 
(ti > e), then from (52) and equation (20) of A it is seen that 


T m rdS p - f T*dS v - b f Kr*dS„- f K>™dS V( 
JS J Jr-e 


where dS vrr is written for 




dx" cx rT 


d?dt? 


( 66 ) 


* It is noteworthy that the energy-momentum integrals though finite, are not unique. Their 
value may depend on the way the point of singularity is approached. The energy is therefore 
indeterminate to this extent. 



On the removal of infinite self-energies of point-particles 157 


and K^ V(T for K^sP—K^s v . It has been shown that as e->0 the left side remains 
finite. Now K^ v<r contains certain terms of the 2nd order, namely, those arising from 


the 3rd order term Tfl 3) — T^l i) —v v in K^. These terms can be removed and fornj. 

K 

a tensor f* iv given by ~ 

pfiv = pi ® + JL [K'^s* - K'^s v ], (67) 


where 


n—m 

K> v = S 

9z=4 


T „ TPnfa 

x (-11 I _ K 


\_n — 2 (n — 2)(n — 3) 


( 68 ) 


Since only 2nd order terms have been removed from K^ vcr it is easily seen from (66) 
that T^’ also gives a finite integral as e->0. But as K^ iV(T now contains no term of 
an order less than the 3rd its integral over the surface £*■ = V tends to zero as ^ -> oo,* 
so that ' r 

T* v dS v = TrdS,- K'v vcr dS V(r) (69) 

Js Js 

where the three-dimensional integrals are to be taken over the entire space outside 
g 1 = e. Since the total energy and momentum obtained by making e tend to zero 
remain finite, the inflow must also be finite. 

For small values of k when T^ v can be expanded into terms of different orders 
with respect to /c, it can be proved without difficulty that 

= T^+s*-^, (70) 


where T^ Z) is the part of T* v left over after omitting terms of order higher than l//c 3 . 

5. All the above results except (58) can be applied to any arbitrary tensor 
haying any number of indices, provided it satisfies a conservation equation 
with respect to v, and can be expanded for small values of k in a series involving 
different powers of /c. The tensor TJ niCL - v , and therefore tJ a - v derived from JJ ct - v by 
the above process, may, in general, still contain a singularity of the order l//c in 
the integral „ 

V a - vS -dQ. (71) 

J K K 

This singularity, howover, must be independent of r. It is now possible to show 
that this singularity can be removed. 

It is easily seen by actual calculation that 


f d 

[dx* _ AT 3 


e~ bK 



j a: a : 3 


(1 — &A:log Jik) 



(72) 


* Here it is assumed that for sufficiently large values of q, = ^.is completely contained in 
the space between two spheres of radius a^ and ffy described about c in the rest system. 
{cl and /?.are constants independent of y.) The total surface of £* = q is assumed to tend to 
infinity as tj 2 . 


Vol, 183. A. 


11 



158 Harish-Chandra 

where b and 7i are positive constants. Therefore 


e-^log h^dQ = ^~ (1 — 6* log Kk) e~ bK . (73) 

The singularity in (71) is of the form c a **7/c, where c a — are certain constants. 
Therefore 


f U a - v —dQ = bc“-■: logMe-o*, 
J K K 

V.~. , 


where 

Even though (74a) contains a singularity it can easily be seen that ; 


(74a) 

(746) 


P 71 dK [u a -' v ~dQ 
J /c=e J K 


remains finite as e^O. Very much in the same manner as for T m * iv it can be shown 
that the integral of U a -* v over any space-like surface is finite no matter how the 
point of singularity be approached. 

Further on account of er hK the surface integral of the term inside brackets in 
(746) vanishes on the infinite sphere, so that in an equation like (69) the integral 
may still be omitted on the surface at infinity. It is to be noted that 6 is entirely 
arbitrary except for the fact that it is greater than zero. 

Now apply these considerations to the case of the angular momentum tensor 
defined by 

__ x ^T var —x v T^, 


which is conserved with respect to <r, due to symmetry of the original tensor T^ v . 
One can therefore build M which for small values of k can be written as 

_ Ml + s' 7 g— (75) 

$ 

Since x* 1 = zHt)-\ — k it is found that 

K 

= x*T v Z g) - x v T?I Z) +- s v T(* iy (76) 

Therefore, using (57), 

Thus a tensor can be constructed defined by 

0 % 

IW" = ^Qp>,<r S p_Qia>,p^ (11a) 



On the removal of infinite self-energies of point-particles 159 


where 


and 




n—i 


M^ n) 


n- 


+ 


/C 

(n—2) (n— 3). 


cJ n ’ er iK log Iik 

-j-v* - S — 

An k s 


cP v = {zPc v -z v cP)+^{srTll i) Sr-s v T^l i) s <r )dQ, 


(776) 

(77c) 


such that the total angular momentum calculated from it is always finite. 

Now return to the special case of the meson and the electromagnetic fields. 
Precisely as in the case of c (equation (58)) it can be shown that since d LV is anti¬ 
symmetric in ji, v while g^ is symmetric 

c* = 0. (78) 


However, it is important to note that owing to the fact that the meson fields contain 
a factor er^ K at infinity (Appendix, equation (116a)),* the angular-momentum 
integral as calculated from Jf>> cr does not contain a singularity at infinity. In the 
electromagnetic case, however, a logarithmic singularity would appear at infinity 
if d* did not vanish. However, as it is, it is found that the total angular momentum 
calculated from is free from singularity both at zero and infinity4 
All the above considerations are immediately applicable to an assembly of more 
than one particle, because near any particle the field due to all others is regular and 
can be included in the ingoing field. We have only to take instead of (67) and (68) 


fpfiv _ qifiv _j_ ^ 


dx* 


xsr. 


where p refers to a particular particle, 

= K'jrZ-K'jrHi, 


n=m(T/ lv . . 

K'r = s l-*==4 




__ nf 

n =i\n-2~ r (n-2){n-Z) K p p 


(78 a) 

(786) 
» (78c) 


* For large spatial distances from the particle (measured in the present space of any 
Lorentz-frame) the corresponding retarded point goes far back in the past of the particle, so 
that for sufficiently large distances the retarded point lies in the region of the world-line 
corresponding to uniform motion of the particle. The value of the fields at infinity may there¬ 
fore be taken to be the same as in the static case. This is also evident from the consideration 
that since potentials and therefore fields are propagated with a velocity at most equal to that 
of light, the value of these quantities at sufficiently large distances must still be unaffected 
by radiation from the particle. The external field of course is supposed to vanish sufficiently 
fast at infinity. 

f In this connexion it may be mentioned that a misprint has occurred in the expression for 
given in the paper by Bhabha & Corben ( 1941 ). In equation (114) of their paper, instead 
of the term 


it should read 


[-‘4 

[- 2 v 41 - 


11-2 



160 


Harish-Chandra 


Here and v v p refer to the particular particle. T^_ n) is the nth. order term in 
when it is expanded in the neighbourhood of the particle p (i.e. for small jc p ). Simi¬ 
larly, corresponding to (77) 


o 

= m^+ S - 6%’ p sl], 

. p 

(79a) 

m h-« •»<r\ 

* ~^L»-2 + (»-2)(»-3)iv *J 

(796) 


can be written. Here = 0 . 

6 . Some special remarks may now be made regarding the symmetrization of the 
modified tensor for the electromagnetic dipole. As already observed in § 1 this 
symmetrization is, however, of no particular importance. 

As proved in the appendix (equation (124)) it is found that for the electromagnetic 
case 

T^s v = 0, (80) 

where 2> viet * has the same meaning as that in the paper of Bhabha & Corben ( 1941 ). 
Therefore 

T m ^ ret. = y/^ret. f rom (5Q), (81) 

which is obviously symmetrical. Since in the static case does not contain terms 
of order lower than 1 /zc 4 (appendix, equation (1166))* then 


■F—Z) static = 0- (82) 

Therefore the integral of T^ eU does not contain any singularities at infinity. - )* 
Using the symbols used by Bhabha & Corben ( 1941 ), we write 


fp/jiv _ fj\pa> ret. _j_ JJfiv mix. _j_ pfiv in. 


(83) 


Because both T^ v ret * and T^ in * are separately conserved T^ vmix - must also be con¬ 
served, and the procedure of the preceding pages can be applied to T^ vmix \ We 
construct 

s 

fpwp> mix. -- y/fvmis. | y/vermis. ^ ( 84 ) 

Further K 


* 

+ {z^T\L^ - z v Tff^’) ^ v* + - s v T(£jf*') ^ v cr . (85) 

From (84) and (85) 

Jf = x jtpMv(rm is. _ + 5 /J pvp_ mix.j $<r _ ti v <r\ _j_ y^pmix.fp 

L l ( 3) \ 9 * ) ( 4) K JJ_’ 


( 86 ) 


* The potentials contain terms of order 1/k or higher. The fields therefore contain terms of 
order l//c 2 or higher. Hence T^ v does not contain terms of order lower than 1/zc 4 . 
t Cf. footnote * on p. 159. 



On the removal of infinite self-energies of point-particles 161 

the minus sign at the end denoting the subtraction of terms got by exchange of 
v in the expression within the square brackets. 

Since the highest singularity in the mixed tensor in the case of an electromagnetic 
dipole is of order 1 /a: 3 , T v f^' = 0 . Using this fact and the conservation of 
an( j yw/^mix. 5 ft i s found from ( 86 ) that 


0 = 




dx* 


—. ywy^mix. rpMftv mix, i ^ ${ipvp mix. (sz— . 

dxf L ( 8) \ p k /__ 


(87) 


Putting 


it is found that 


1 d 

p ,l! pv mix._ & -j- JFh v ] 


( 88 ) 

(89) 


is symmetrical and is conserved (Pauli 1941 ). It is also in the form ( 1 ) as required. 
Further, since is of the order 1/a: 2 the additional term does not introduce any 
singularity in the energy integral as is obvious from ( 66 ). Also, since TfZgf** vanishes 
at infinity (due to the absence of the external field in the remote past),* the integral 
in ( 66 ) over the surface at infinity is zero. 


— 10 

Thus Tf* v — T^ et “ + T m * lv mix ' — -—- [ L* lv ’ a -f If^ v — L V(T 

2 0x°" 


j + 


can be taken as the modified tensor. This tensor is symmetrical. That in the case of 
a point charge«this symmetry is possible has already been shown by Pryce ( 1938 ). 
In this case 


iji/iv ret. = i! ( 1_ y ') a _..v | /j («g+gg) 

47t L /c 6 ' a: 5 a: 4 


Thus according to (51) 

( o \ r p[*v rpfur « 

Tit s)- TfZ S) f if) + -f i) +XT7^ 


T- 

(90) 


* For small k the retarded potentials U v and the retarded field quantities G pv can be 
expanded in a series of positive and negative powers of k 9 and the ingoing field in a Taylor’s 
series with positive powers of k 


^•=(<Vr 0 ) + 



fixPdX*-) K K 


where the suffix z. p (j 0 ) means that the corresponding quantities are to be evaluated at the 

. u.v 

retarded point. Therefore consists of terms of the form where u and v are both 

of order zero in k and u comes from Gfjf ', while v comes from . The factor in v having the 
suffix z p (T 0 ) is to be evaluated at the retarded point. At infinity v — 0 because the corre¬ 
sponding retarded point goes to the remote past and there the field quantities and their 
differentials vanish. 



162 


Harish-Chandra 


(In spite of the fact that K^ v is taken instead of K'i iv given by ( 68 ), the third term in 
( 66 ) vanishes as 97-^00 due to (82).*) So that 


K** = Kp»$F—K^s v 


=^[d-o J -**)+ s/i 


(V v S a - V°S v ) 1 (g ^-gr^g’Q J 


(91) 


It is interesting to see that this is different from Pryce’s tensor which is more com¬ 
plicated. It is (Pryce 1938 ) 


p 2 r~ 

K^.o- = 

477 L 


,(«W-»V) „ , {g^-g^v”) , x (g^-g^s*) 

i-~-(I ZK ) 


K 5 


4/c 3 


3(^3*-v*tr) 

+ - v + 


4/c 4 

3(v v s <T —v <T s v ) 

4^ 




The modified tensor obtained from both these expressions is symmetric. Our 
modified tensor is just equal to T^f tm as is proved quite generally in (81). 


In conclusion I wish to express my thanks to Professor H. J. Bhabha for valuable 1 
discussions, general guidance and suggested improvements in the writing of this 
paper. 


Appendix 


Consider now 


K — ^T^ v K v .K 2 d(i), 


(92) 


r being kept constant during differentiation. Here do) is an element of solid angle 
in the rest system and dQ = K 2 doj. Since 


k = = [x^-z^t)] t*(r), 

on differentiation keeping r constant 


For constant r 


dK = dx„v fl . 

r 

sPdx fl = 0 . 


(93 a) 
(936) 


These are the only two conditions which the infinitesimal vector which connects 
a point on the sphere k to a corresponding point on the slightly larger sphere k+cLk, 
need satisfy. The way in which the points on these two spheres may be put in corre¬ 
spondence is immaterial. Now take dx^ along thus satisfying (936) automatically. 
(93 a) then gives 

ds u = dx u = die—, 

* * K 


i.e. 


dK K 


(94) 


* Cf. footnote * on p. 159. 



On the removal of infinite self-energies of point-particles 

From (94) it is easily found that 

s©- 0 ' 

from which it may be concluded that 

d 


dK 


(dco) = 0 


163 


(95) 


(96) 


as dw is dependent on sJk only. Thus 

d C C 9 dx? C 

— T^k v ,K 2 do) = ^ {T^k p ) K 2 doj + 2 T^K v Kdo). 

~ ~K~K (fr0m e( l Uati0nS ( 12 ) of A = ( 94 ) and ( 95 )) 
4 k \t^ = + K v dQ+j ^ k! dQ 


Since 

then 


. |jL 

Jdxr 


[T^s a - T^s”]. K v dQ, 


(97) 


as 


0 0 rppr o 

__ [Tt“’s' T - T^s"] = 23 T /“'+s«--g—- + (cf. equation ( 66 )) 


ggv* ^ /s \ 

if - = 0 . Since = 0 it is easy to prove further that 


4 J v '<" 


s® s* _ f_3 


-dfi 


J dx* [T ^~- K ^- S K dQ ’ 


(98) 


as quoted in the text. Remembering (94) and (95) it can similarly be proved that 


4 

dK 


so that 


r S “ s A 1 /*r dTi" r ] s* s* 

[T^s,,—...—dQ = - \ 3 T^+s*-^ — s v —... — dQ, 

J K K KJ dx* K K 


(99) 


K 


d [pt‘%-...-dQ = \\2T^+^ d -^+T^'As v -...-dQ 

CUC J K K J | OX°^ K K K 


■J* 


act qA> 

= \6T* v .s v -...-dQ. 

K K 


( 100 ) 


In the same way it is found that 


d_. 

dK 


j \Tr{il-v 9 ir)£..£dQ = \ J[2 T^+se d -^fj(S^-v^...jdQ, (101) 

because Sl~v a v v , which depends on r alone, is to be treated as a constant here. 
Since v <r (S^—v l7 v v ) = 0 then 

K^jT^-v^)^...^dQ = jeT^.W-v^.^dQ. 


( 102 ) 



164 


Harish-Chandra 


The most general equation which the potentials U v of a meson field may have 
to satisfy is 

d ~ U’ + xW - 4t(103) 


3a^ 3a: „ 


3# a * ’ ‘ dx x ' 


where Z a — X > v is a tensor describing the effect of a multipole on the field. In fact, in 
the most general case the right side is a sum of terms containing different numbers 
of differentiations, but it will be sufficient to consider one such term since the solution 
of this general case is just the sum of the solutions of each of the equations involving 
one term only. Since interest arises only in the case of point-particles we can write 
(cf. Bhabha 1941 , p. 321) 


Va... X t v 
^(*p> 


= f°° drSfa- X ’ v 8(x°-z 0 )8(x 1 -z L )8{x*-z st )8(x*-z 3 ). (104) 

J — 00 


As is well known the solution for the retarded potentials is 
d d 


TJv ret. __ 

- fax’ 


*■» 

where u p (r) = x p — z p (r) } u = (u p u^ and r 0 is the retarded time given by u(r 0 ) = 0 . 
Here the first term is just the one which alone appears for the electromagnetic field 
(X = 0). It is obviously expressible in a finite series involving terms of different 
orders in k. The differentiations in the 2 nd term act in two ways. Tirst, on the limit 
of integration r 0 , and secondly, on the integrand. Bearing in mind that 


and 


then 


9 ( JrM \ vu 

03* \ U n ) X X U n + X 

du k __ 

~ g *- a> 


d_ _ 0 _ 
dx v ’ * * 3# A 




can be evaluated in terms of u a and „ J . Also, since 


vr 


and 


u«(t 0 ) = S* 

/ W \ = ( JrM\ = r 

\ / To \ -u n /„_*> 2».«!’ 


(106) 

(107) 

(108) 

(109) 

( 110 ) 


then the value of (108) at the retarded point can be expressed as a finite series in 
s a involving terms of different orders in k. It should be noted that 

JL f ro JL JL ( h dT 

3af"3a:n u ) 

+f T * &>„*—— JL('hM) dT (ill) 

tcL 03” 0*^ u /Jr. J— 0^0af-0a:H u / ( ' 



165 


On the removal of infinite self-energies of point-particles 


Again the first term on the right and its differentials can be written as finite series 
in s a containing terms of different orders in k. Therefore the only term in U VTet * which 
has to be considered is 



Sa ...X, V d _ 9 

dx a '"dx x \ u j 


dr. 


( 112 ) 


As shown above this will consist of terms of the type 



,l U n 


(113) 


It is noteworthy that all these terms are non-singular. They can be expanded in a 
series cont aining only terms of positive order in k in the following way. Write 
u p =x p -z p (r) = l p -s p , where I p =z p (t)-z p (t 0 ), so that u 2 = l p V>-2,lPs p = l 2 -2(ls), 
where l = (l p I p)* and (Is) = (l p sP). Then 

W) m4 * 

u* (VP a - 2 (k)])» ‘ 


Since — is a continuous and differentiable function of l 2 , (114) can be written in 
the form of a Taylor’s series for sufficiently small values of s p 


Jn(xu) Mxi) . s 

u n l* p\ \dl 2 ) l n 

JniXl) , S r JW J n+p ( X l) 
l n p! 


(115) 


It follows from (113) and (115) that the expansion of (105) in a series of s a is possible. 
s a are independent of r and so can be put outside the integral sign. Since the inte¬ 
grands involve only and Z A which are functions of r and r 0 alone, the integral 

is a function of r 0 alone. It is now possible to separate terms of different orders by 
writing s a /fc instead of s a and multiplying by a suitable power of k to compensate it. 
Thus the expansion of U VTet - in the required form is obtained. which involves 
one more differentiation, can be expanded in an entirely similar manner. Since the 
ingoing field and potentials are always expressible as such a series by Taylor’s 
expansion, T^ v can be written in the required form with different order terms sep¬ 
arated. This result is used in (9) and (10) in the text. 

As is well known, in the static case when S a -- X > v is constant we get 


JJv _ 


JLA JL 

dx i dx k '' ’ dx m 


(?) 


(116a) 


(Here the latin indices run from 1 to 3 only.) In the electromagnetic case, by putting 
X = 0, we get 


JJv $ik...m,v 


AJL A 

8x i 8x k dx m 



(1166) 



166 


Harish-Chandra 


From. (105) it is found that in the electromagnetic case the retarded potentials* 
(f? do not contain terms of order lower than 1/at. These satisfy the equation 


S ^ = 0. 
dx v 

Using the relations (44) and (54), it is found from (117) that 


K ex’ 




and 


dx l 


K CX a 


= 0 for n > 1 


(117) 

(118a) 

(1186) 


Here (— n) denotes the nth. order term. Now calculate 

[F w (- n ) -^(-1) + ^\-n) + -^(“-1)] s p> • (U9) 

where F„„ are the retarded electromagnetic field strengths. From the relation 


dx» dx v 


it is found that 


and 


-[V^L 


n> 1 


( 120 ) 

(121a) 

( 1216 ) 


Using (1216) and (1186) it is found that F /tvi _ 1) s v = 0, so that the first term in 
(119) does not contribute anything. Now 



F ov __ pH cv^o- ,_ *v(fto- S fi «cr\ ^v(-n+l) „cr 

3^ l" k ) to? 3x <r J 



= |V - a f g°>) ^^*g**>] fr °m (U8a), 

(122) 

so that 

F F vp « — F f ^$p{-n+ 1) rtJ o ^^(-71+1)1 

Vd / (-’» s p - 3a* 3a.p j 

(123) 

aS -^ikC-X) 

a" = 0. Also 



^<-.>^4, - 2*?4,[^% s ->+ 



= 2FfV-ig 


— n+1) 


* <j> v is used to denote U vWtm for the electromagnetic field. 



167 


On the removal of infinite self-energies of point-particles 

Thus (119) becomes 

^ 1 %“’ - °- 

Thus the expression (119) vanishes. Since 4 s p is just this expression with 
n = 2 , then 

T^s v = 0. (124) 


References 

Bhabha 1939 Proc. Roy . Soc. A, 172, 384-409. 

Bhabha 1941 Proc . Bog/. $oc. A, 178, 315-350. 

Bhabha & Corben 1941 Proc . Bot/. $00. A, 178, 273-314. 
Bhabha & Harish-Chandra 1944 Proc . /80c. A, 183, 134. 

Dirac 1938 Proc. Roy. Soc. A, 167, 148-169. 

Pauli 1941 Rev. Mod. Phys. 13, 203-232. 

Pryce 1938 Proc. Roy. Soc. A, 168, 389-401, 


The combination of fibrous proteins with acids 
II. The adsorption of dye anions 

By G. A. Gilbert 

(Communicated by E. K. Rideal , F.R.S.—Received 1 February 1944) 

Equations developed previously for the titration of fibrous proteins with acids are shown to 
accord with extant data for the adsorption of dyes in the presence of salt. The contribution 
of particular groups in the dye molecules to the free energy of adsorption is determined, and 
the influence of the valency of the dye ion is related to the potential on the protein fibre. The 
effect of temperature on the ionic equilibria is calculated, and estimated experimentally for 
a simple dye using a rapid ‘null’ method for the attainment of equilibrium. 


Introduction 

In Part I (Gilbert & Rideal 1944 , quoted as (I)) the adsorption of pure acids by 
fibrous proteins was examined. It was shown that equal numbers of anions and 
cations were adsorbed by an initially isoelectric fibre, the fibre as a whole becoming 
positively or negatively charged to achieve this equivalence. 

I. Factors influencing the affinity of dyes 

Ions are bound to proteins by electrostatic forces arising from the net charge of 
the protein, and by more specific forces of shorter range involving the immediate 



168 


G. A. Gilbert 


environment of the ions. With this in mind, consider wool keratin in equilibrium with 
dilute sulphuric acid, sodium sulphate and the sodium salt of a dye acid such as 
Orange 2G. If the dye is fairly simple, it can be stated in broad outline how the 
equilibria will be affected by changes in the relative amounts of the constituents 
(cf. Hallitt 1899 , Bancroft 1914 , 1915 ). 

Protons and dye anions are strongly adsorbed through their high affinity for 
keratin. If equivalently there are more protons present than dye ions, sulphate ions 
are adsorbed appreciably in spite of their low intrinsic affinity; if there are less 
protons than dye ions, sodium ions are adsorbed. 

Assuming that equilibrium can be reached, the final distribution of ions is deter¬ 
mined by their intrinsic affinities and concentrations. A model was used in (I) to 
express this quantitatively, and involved assigning to the fibre an equal number of 
positive and negative sites on which anions and cations respectively were randomly 
distributed. No allowance was made for the adsorption of more than one ion on the 
same site, nor for the adsorption of one kind of ion by more than one type of site. 
That there are circumstances when more than one type of site comes into play is 
obvious from the titration curves of keratin titrated with weak acids. It is probable 
that charged molecules with a very high intrinsic affinity for keratin (even when they 
contain such strongly ionized groups as —SO 3 ) can be adsorbed by other than posi¬ 
tively charged sites, and then perhaps form sites themselves for the adsorption of 
cations. In addition, the more complex the anion adsorbed, the less is it likely to 
be legitimate to treat all the positive sites as identical. However, if the simple model 
fails, it will be difficult to determine what approximations to apply, for other factors 
come into play. Thus molecules as large as dye anions probably find some sites 
extremely difficult to reach at temperatures which do not cause fibre decomposition, 
and may furthermore undergo aggregation through the very forces that confer 
their high affinity. 

For the ideal system represented by the model, it was shown in (I) that the 
fraction 6 i of the total sites occupied by an ion of type i and valency Z was 
given by (cf. equation (31) (I)) 


BThxj^j = RThiOi- Apt - ZFijr, 

i 


(i) 


if activity coefficients in solution are neglected. R is the gas constant, T the absolute 
temperature, the concentration of the ion in the solution, F the Faraday, ifr the 
fibre potential, and A fa the intrinsic affinity. (I) should be consulted for the precise 
definition of the symbols. 


The term 


e< ' 

i-x&i 


plays a similar role in the fibre to that of the concentration 


C t in solution, and is increased by decreasing the adsorption of other kinds of ions 
of the same sign, or by reducing A/i t or Zip'. This can be illustrated by some experi¬ 
ments of Speakman & Clegg ( 1934 ). 



The combination of fibrous proteins with acids 


169 


The influence of the valency of a dye ion 

First, find how the potential ijr will vary if sodium sulphate is increased in con¬ 
centration in the system wool keratin, H 2 S0 4 , Na 2 S0 4 , Na Z D (where D is a dye 
anion of valency Z ). To correspond to the conditions of Speakman & Clegg's experi¬ 
ments, let it be supposed that the sulphuric acid present is insufficient to saturate 
the fibre, but is still in fair excess compared with the dye, and let the volume of the 
solution per unit weight of fibre be small enough for most of the protons present to 
be adsorbed. They will be accompanied by the majority of the dye ions, and also by 
sulphate ions almost sufficient in number to neutralize the change resulting from the 
protons and dye ions. When the salt concentration is low adsorbed sodium ions can 
be neglected, since under the conditions above the fibre is positively charged, and 
the affinity of the sodium ions is reduced to a very low value indeed by electrostatic 
repulsion. 

Rewriting equation (1) with Z = 2 for the sulphate ion, replacing C i by S0 4 (the 
external sulphate ion concentration), and allowing for the ion being negative. 


/ RT i 


'SOi 


1 - 2 & 


, A/t so RT , or ^ 
+ TF-17 taso * 


( 2 ) 


is obtained for the fibre potential. The expansion of (1 —is (1 — # SOl — 

% 

where d D is the fraction of anion sites occupied by dye ions. It is practically in¬ 
dependent of S0 4 , since ^S0 4 > and d H do not vary very much. To a rough degree 
of approximation, therefore, 

RT 

ijr as constant — In S0 4 (3) 


for low concentrations of salt, and the potential is controlled by the amount of 
sodium sulphate added to the external solution. As the salt concentration is increased, 
however, # Na becomes more important. This leads to an increase in # g04 which 
counteracts the external increase in sulphate-ion concentration, and results in a 
progressively slower rate of decrease of potential. Ultimately a limiting potential 
would be reached asymptotically if the terms A fa were independent of the salt 
concentration. In practice, salting out of the non-polar part of a dye must 
occur and Aji i decrease, but for present purposes it is sufficient to realize 
that the fall in potential is only given by equation (3) at low salt concentrations 
(cf. figure (2) (I)). 

The change in potential is mirrored by the change in the distribution of the dye ions 
in accordance with equation (1). Substituting from equation (2) into equation (1), 

\ 

= lZRTln^-(A / i i -iZAfi SOi ) + i(2-Z)RTln(l-^e i ). (4) 



(OoOOl) *o! iu s°i 


170 


Gr. A. Gilbert 


If there are A sites per gram of keratin to be shared among sulphate and dye ions, 
each ion occupying only one site, 

Q. = 5 * 


and 


= 




so 4 


so 4 £ 


where % = number of dye ions adsorbed per gram of keratin, and n SQ ^ = number of 
sulphate ions adsorbed per gram of keratin. Then 

In| - — iZln ^ — ( J/ “~ *^ 30 ‘ ) + M2 - Z) In (A-- », - %0 ,). (5) 



Figure 1. 


log S0 4 

* 

-theoretical slope. 

HO 




HO 




' 0 Dye 5 


-N=N- 



log %/a, (ioo° c) 


171 


The combination of fibrous proteins with acids 

Under the conditions of Speakman & Clegg’s experiments, n SOi and (A—n { — n S04 ) 
were fairly constant. Increasing quantities of sodium sulphate were added to the 
dye solution, kept at boiling-point, and measurement made of the amount of dye 
in the solution. From their results log (nfC^ has been calculated, and this quantity 
plotted against logS0 4 in figures 1-6, taking as an approximate measure of the 
latter the amount of sodium sulphate added per litre of solution. 

It is evident that at low concentrations of salt equation (5) expresses the correct 
dependence of the slope of log(w i /C f i )/logS0 4 on the valency of the dye ions (the 
theoretical slope is indicated in each case), but at higher concentrations departures 
occur in the expected direction. 





(OoOOl) f o/*»8oi 


172 


G. A. Gilbert 



Influence of the constitution of the dye 


The curves allow the influence of a further factor to be seen. Dyes with the same 
valency Z are equally affected by changes in the non-specific force due to the fibre 
charge. £ Desorption 9 curves for such dyes are not necessarily superimposed, how¬ 
ever, because each dye has an individual value for its intrinsic affinity Aji^ In 
fact, the vertical displacement of one curve with respect to a parallel curve for 
another dye is a direct measure of the difference in A fa for the two dyes. By relating 
such differences to the corresponding change in structure, it is possible to find what 
increment of chemical potential is conferred by particular elements of structure. 
Since the curves for a given valency are parallel, any pair of corresponding points 
can be selected to obtain this information. A number of dyes measured by Speakman 
& Clegg at only one concentration of sulphate ion can then be included. In table I 





174 


G. A. Gilbert 


Although the values of the increment are all of the same order, it cannot be said 
from these figures to what extent the increment depends on the molecule as a whole; 
and further work with refined methods of purification, and the elimination of the 
approximations from the terms in equation (5), would be needed for that purpose. 

The substitution of a benzene ring for a methyl group gives a dye molecule of 
greater affinity, as can be seen from table 3. 


dyes 

compared 


17 , 1 


20, 22 


18, 13 


Table 3 



increment 
in A fa 
keaL 


-0-69 


- 0-66 


-0*96 


Comparison of the average increment here of —0*77 kcal. with the previous 
average in table 3 of — 1*34 kcal. suggests that a methyl group corresponds to an 
increment of — 0-57 kcal. 

A different situation arises when an —S0g group is added to a dye ion. The valency 
is increased by one, and the net effect on the adsorption of the dye ion is determined 
by the sum of the electrostatic contribution ip'F, and the change in intrinsic affinity 
(i.e. in A fa). If the fibre potential is high (i.e. little sodium sulphate is present) the 
adsorption tendency of the dye is greatly increased because i/rF is large and negative, 
but when much salt is present, the increase in the hydrophilic character of the dye, 
through the masking of a benzene ring, is no longer outweighed by the electrostatic 
term. The adsorption of the more highly charged dye ion is therefore less in strong 
salt solution than that of the less charged ion, in contrast to the position in dilute 
salt solution. 

The decrease in — A fa per SO 3 group seems to get less afe the number of —SO 3 
groups increases, but the evidence is too scanty to formulate rules as yet. The three 



175 


The combination of fibrous proteins with acids 

comparisons given in table 4, however, made from measurements by Speakman & 
Clegg as before in solutions containing 130 g. of Na 2 SO 4 10 H 2 O per litre, show that 
the introduction of the second sulpho group reduces the intrinsic affinity (assuming 
ijr to be small) by almost the amount contributed by a ring. 


dyes 

compared 


1 , 8 


Table 4 


structures 


HO 



HO 



increment 

{AHi+i}rF) 

keal. 


1*03 


13, 14 


^ ° 3S ~^ 


\ _ v 

h. 


HO 


HO 


2 , 



0*86 


0*96 


II. The temperature coefficient of the ionic equilibria 

* The relations above hold for the particular temperature of 'the experiments. 
Frequently the value of Afa for an ion is strongly temperature dependent, and the 
various equilibria are considerably altered between the temperature limits of 0 and 
100 ° C (Wyman 1939 ; Steinhardt, Fugitt & Harris 1940 ). Some care is necessary as 
before in distinguishing between dissolved and fibrous proteins. In the adsorption 
of a single pure acid by a dissolved protein, it is known that if the anion has very 
little affinity for protein, hydrogen ions are adsorbed in the acid region of the 
titration curve practically without accompanying anions, and against an increasing 
positive potential. The differential heat of adsorption of the protons should therefore * 
decrease with increasing saturation of the fibre. However, one would expect the 
magnitude of this effect to be small, from the very fact that the potential is not great 
enough to ensure the adsorption of anions. 

Fibrous proteins, on the other hand, cannot adsorb protons without accompanying 
anions, and the sum of the heats of adsorption of protons and anions is obtained, 
instead of the heat of adsorption of the protons alone. The fibre potential does not 
affect this sum, as its contribution is equal and opposite for the cations and anions. 


« 12-2 



176 


G. A. Gilbert 


It should be noticed that this potential must vary with temperature, since it 
depends on the values of A fa for the various ions. 

In deriving heats of adsorption from the values of the chemical potential in the 
two phases, solution and fibre, it is simplest to consider first the adsorption of a • 
single acid with an anion of type i and valency Z. 

Ah intrinsic partial molar heat of adsorption AH i can be defined for each ion which 
will correspond to the intrinsic affinity A fa, and which will be the part of the heat 
which does not arise from the fibre potential. The fact that the temperature coefficient 
of the chemical potential throughout the system must be one-valued can then be 
taken into account, if the chemical potential of the ion is to remain unique, and 
expressed for an anion by 


Since, by the definition above, 


3 (/4 


d(l/T)\Tj mie 8(1 IT) 
it follows that if 6 i is kept constant, 


(f4\ _ 9 (/4\ 9 (4b\ = 

\T /fibre 9(i m \t) soln. 3(1/2’) \ T / 




d(ZFflT) dfoCj 
* 0(1 IT) “ 8(1 IT)' 


The analogous equation for the proton is 


EliminatingJ- 


MjT) 

8(1 IT) 5 


8 {Ff jT) p 81n H 

H+ 8(1 /T) “ 8(1 jT)' 


AH i+ ZAH n = AH^ = R^ ¥) {]nC i + Z]nH}, 


( 6 ) 

(?) 

( 8 ) 

(9) 


( 10 ) 


where ZlZ7 aeW stands for the heat of adsorption of a gram-molecule of acid. When 
Cj = ZH, as in solutions of pure acid, 


^•®aeid = (■2’+ 1) -R 


8 

9(1 m 


<&B). 


( 11 ) 


On the other hand, if C t is kept constant by the addition of the sodium salt of the 
acid, and if sodium ions are not appreciably adsorbed, 


AH &cia = ZR^^QnH). (12) 

It can therefore be predicted that for univalent anions at a low hut constant ionic 
strength, the change of pH with temperature for a given fibre content of acid is twice 
as great as it would be foT solutions of pure acid, because the whole brunt of correcting 
for changes in A fa and then falls on the hydrogen ions. Direct experimental 



177 


The combination of fibrous proteins with acids 


evidence for this is lacking because the temperature coefficient is very small for the 
tf acid 5 titration curve, but it follows automatically if titration curves such as those 
for HC1, KC1 mixtures in figure (( 1 ), I) hold at different temperatures. Returning 
to the example presented by dyeing, it is seen that since the important ratio nfCi 
is determined by the difference term (A/i^^ZAp S04 ), the temperature coefficient 
of n i jC i is determined largely by the corresponding term (A^—^ZAHqqJ or 
equivalently by (AH Hzi — \Z ^-HsaSoJ* where AH Uzi is the partial molar heat of 
adsorption of the dye acid H z i, and AH^ S0 ^ that of sulphuric acid. 

When the experimental conditions are adjusted so that (A — % — n so J is constant, 
differentiation of equation (5) gives 


J322 , 
2(1 /T) 1 



3 

3(1 IT) 


(A^-^ZA/iSOJ 

T 


AH^hZAH^, 


(13) 


With properly controlled conditions, it is therefore possible to obtain the heat of 
replacement of a dye ion by an equivalent number of sulphate ions. 

Equation (13) was tested experimentally for a relatively simple case. A pure 
sample of the non-aggregating dye below was provided by the research department 
of Messrs I.C.I. (Dyestuffs Group) Ltd., and used to dye a specimen of fine quality 
wool very kindly given by Professor J. B. Speakman. 


HO 



n . rgo "Ji# 

In such experiments the temperature coefficient of the term —- is required 

W L n so 4 J 


over as wide a range as possible. The problem of getting equilibrium at high tem¬ 
peratures must therefore be faced, in spite of decomposition of the keratin, and at 
low temperatures in spite of the slow diffusion of the dye ions. Some appreciation 
of the latter difficulty can be obtained from the study made by Speakman & Smith 
( 1936 ) of the rate of adsorption by various types of keratin of Orange 2 G (an isomer 
of the dye used here). These authors determined an apparent relative value of the 
diffusion coefficient Ic of this dye in terms of the initial value of Q/*Jt, where Q is the 
amount of dye adsorbed in time t. By analogy with diffusion into a semi-infinite 
slab, this ratio is at first constant and proportional to provided the surface con¬ 
centration of dye remains constant. The latter condition appeared to be moderately 
satisfied. The values of Qj*Jt obtained by these authors have been plotted against the 
reciprocal of the absolute temperature in figure 7. It is apparent that the diffusion 
of dye ions through keratin is an activated process, and one, moreover, requiring 
a high energy of activation. In the case of human hair this energy is of the order of 
28 kcal., and for mohair about 23 kcal. A many thousandfold decrease in the diffusion 
coefficient therefore occurs if the temperature is lowered from 100 to 0 ° C. 



178 


G. A. Gilbert 


The apparent decrease in the energy of activation at high temperatures in the 
presence of salt is due to the increasing unsaturation of the surface. The number of 
dye ions rather than their activity appears to be the relevant factor, and when the 
surface is unsaturated this number becomes temperature dependent. InN/ 10 Na 2 SO 4 
solution, the rate of dyeing does not increase above 60° C, probably because the rate 
of decrease of the number of dye ions at the surface (determined by a heat of exchange 
with sulphate ions of about 10 keal.) just balances the increase in the diffusion 
coefficient (Q/^Jt varying in accordance with 23/2 kcal.). 


2*0 r 


016 


012 


5 


© 0-8 


0*4 


1 

CD 

■s 

I 


ho 0*0 
o 


1*6 



34 3*2 3*0 2^8 ^6 

T- 1 x 10 s (° K) 


Figure 7 . Data from Speakman & Smith ( 1936 ). {a) mohair: pH 1 * 11 ; (6) mohair: pH 2-04; 
(c) mohair: pH 2*14,N/501Sra 2 SO 4 ; {d) mohair: pH 2*39, n/ 10 Na 2 S0 4 ; (e) human hair: pH 2*04. 



varies with the origin and history of the wool fibre. It is 


The value of (" 

°i L n S0 4 J 

therefore imperative to use the same sample of dyed wool for measurements of the 
temperature coefficient. A stock sample containing a known amount of dye acid 
and sulphuric acid can be prepared by dyeing at a fairly high temperature (e.g. 90° C) 
in the presence of salt, filtering at this temperature, centrifuging, and washing at 
0 ° 0 with small quantities of water to remove decomposition products and salt. 
After centrifuging, brief warming restores complete equilibrium. Washing a part 
of this sample at 0 ° 0 with a small quantity of salt solution of a chosen strength, 
and centrifuging, then prepares a specimen for work in the given salt solution. 



179 


The combination of fibrous proteins with acids 

If now the temperature of such a treated sample is changed, the very small amount* 
of residual solution immediately comes into new equilibrium owing to the minute 
amounts of dye and acid involved. This new equilibrium can be found experimentally 
by putting the fibre into a minimal amount of a solution previously adjusted to 
approximately the required concentration of dye, salt and acid. Pinal equilibrium 
is reached almost immediately, even at 0° C. The approximately adjusted solution 
is conveniently prepared by adding samples of the same dyed fibre at the given 
temperature to a solution containing the correct concentration of salt and then 
replacing them by the final sample. Indeed, one often finds, if the volume of solution 
relative to the fibre is kept small, that a single sample added to a salt solution of 
approximately the correct pH, will give within the time required for the temperature 
to become uniform, a concentration of dye that is unchanged by the addition of 
further samples. An alternative method, but one often not as convenient, is to flow 
salt solution through a column of dyed wool, when the upper part of the column 
plays the part of the preliminary samples. 



Figure 8 . Micro-adsorption cell. 



HO 

O s &V S—^ \ by SO/ 


10 / 



180 


G. A. Gilbert 


An adsorption cell which was found suitable for equilibrating solution and fibre 
is drawn in figure 8. It was immersed in a thermostat, and arranged so that about 
3 ml. of solution could be drawn back and forth through a tightly packed mat of 
fibre (ca. 0-5 g.). Removal of solution for analysis was carried out by increasing the 
pressure on the left-hand side (see figure) after removing the membrane over the 
fine capillary which leads from the arm containing the wool. The dye content of 
the solution was estimated polarographically by comparing the reduction wave 
height with that of standard solutions of dye. 

Some measurements obtained by this null method, which avoids decomposition 
at high temperatures and lack of equilibrium at low, besides keeping the fibre 
content constant,, are given in table 5. These results are presented graphically in 
figure 9. They are only exploratory, and do not possess the accuracy that could be 

Table 5 

H+ per gram of fibre (dry weight) = 4*05 x 10 ~ 4 g.equiv. 

SO 4 per gram of fibre (dry weight) = 2*79 x 10 ~ 4 g.equiv. 

Dye per gram of fibre (dry weight) = 1*26 x 10 ~ 4 g.equiv. 




dye concentration 


sulphate 

temp. T° C 

l/T° K 

Ci X 10 -6 M 

log Ci 

concentration S0 4 m 

90 

2*753 

3*5 

5-545 

3-7 x 10 - 1 

30*3 

3*295 

4*36 

5-640 

0-5 x 10-* 

45*2 

3*141 

9*50 

6-978 


58*8 

3*012 

17*2 

5-236 


71-7 

2*901 

31*2 

5-494 


88*5 

2*765 

66*1 

5-820 


0 

3*660 

8*7 

5-940 

0*5 x 10 - 1 

15*6 

3*462 

15*5 

5-190 


30*6 

3*291 

33*0 

5-519 


44 

3*153 

63*2 

5-801 


58 

3*019 

107 

4-029 


70*5 

2*909 

190 

1-279 


90*4 

2*750 

404 

4-606 


0 

3*660 

7*1 

5-851 

0-5 x 10“ l 

0 

3*660 

7*85 

6-895 


53*1 

3*064 

97 

5-987 


87*2' 

2*775 

401 

1-603 



attained by the methods outlined. However, the interesting result emerges that 
there is a large entropy of exchange. In terms of the partial molar heats and 
entropies of adsorption, 

{Apt-ApBo) = {AH i -AH &0i )-T{AS i -AS^ l ). 

From equation (5), since Z = 2, 

C i%5 0 4 

from which it is found that at 90° C is approximately —3000 cal., 

whereas the above results show (AHi—AH SOi ) to be approximately —10,000 cal. 



181 


The combination of fibrous proteins with acids 

Hence (ZliS^—Zl$ S04 ) = —= —19 cal./degree. Much greater restriction seems 
therefore to apply to the large heteropolar dye molecule within the fibre than to the 
smaller sulphate ions. 

The author gratefully acknowledges continued and valuable discussion with 
Professor E. K. Rideal, and members of his Laboratory. He is indebted to Messrs 
I.C.I. (Dyestuffs) Ltd. for a grant which enabled this investigation to be made. 


References 

Bancroft, W. D. 1914 J. Phys. Ghem . 18, 1, 118, 385. 

Bancroft, W. D. 1915 J. Phys. Chem. 19, 50, 145. 

Gilbert, G. A. & Rideal, E. K. 1944 Proc. Roy. Soc. A, 182, 335. 

Hallitt, A. W. 1899 S° c ' Dy- Gol., Bradford, 15, 30. 

Speakman, J. B. & Clegg, G. 1934 J. Soc. By. Gol., Bradford, 50, 348. 

Speakman, J. B. & Smith, S. G. 1936 J. Soc. By. Gol., Bradford , 52, 121. 

Steinhardt, J., Fugitt, C. H. & Harris, M. 1940 Bur. Stand. J. Res., Wash., 25, 519. 
Wyman, J. Jr. 1939 J. Biol. Ghem. 127, 1 . 


The transition effect of penetrating showers 

By L. Janossy and G. D. Rochester 
The Physical Laboratories, The University, Manchester 

(i Communicated by P. M. S. Blackett, F.B.S.—Received 2 March 1944) 

Observations of the transition effect of penetrating showers are reported and the effect is 
found to agree with the theory of Hamilton, Heitler & Peng as extended by Jdnossy. The 
^-dependence of the transition effect reported previously by Janossy has not been confirmed. 

Introduction 

For the interpretation of penetrating showers it is important to know the details 
of the transition effect accurately. We have therefore made measurements on the 
transition effect with a view to checking and improving upon previous results. The 
effect for lead was measured in some detail, but because of shortage of materials 
only one point was obtained for aluminium. 

4 

The experimental arrangement and the results 

The experimental arrangement is illustrated diagrammatically in figures 1 a, b. 
It consists of three counter trays separated by layers of lead 15 cm. in thickness. 
As the arrangement is very similar to those used by Janossy ( 1942 ) and by Janossy 



182 


L. Janossy and G. D. Rochester 

& Rochester ( 1943 ), a detailed description will not be given. The main difference 
from the arrangement used by Janossy & Rochester is that the top tray of counters 
in the present arrangement contains eight counters instead of fifteen. The rate of 
triple coincidences B 1 B 2 B Z is much reduced with eight counters, and therefore the 
rates of casual coincidences and triple knock-on showers are also .much reduced 
without a corresponding decrease in the number of penetrating showers. It was 
found that the total rate of casual coincidences plus triple knock-on showers 
amounted to less than 5 % of the sevenfold coincidence rate. 



(b) 


Figure 1 

Absorbers 46 x 46 cm . 2 in area w r ere brought close above tray B, and the rate of 
sevenfold coincidences as a function of the thickness of the absorber was determined. 
The area of the absorber was the same as in the previous arrangements. 

Readings for all absorbers, taken over a period extending from December 1942 
to November 1943, are given in table 1 . They have all been reduced to a standard 
barometric pressure of 76 cm. by using the barometer coefficient ( — 11-7 % per 
cm. Hg), obtained by Janossy & Rochester ( 1944 ). The results are plotted in figure 2 . 








The transition effect of penetrating showers 183 

The results given in the table or in any of the diagrams have not been corrected 
for the rate of accidental coincidences or knock-on showers, since in studying the 
transition effect we are mainly interested in the change of the shower rate and not 
in the actual rate. The variations in the rate of accidental coincidences plus knock-on 
showers is estimated to be less than one-third of the actual accidental rate, i.e. less 
than 2%. 



T 


T^le 1 




, - 



time 

rate 

material 

cm. 

g./cm. 2 

counts 

hr. 

counts per hour 

light roof 

0-00 

0-00 

216 

1133-8 

0-187 4 0*013 

lead 

0-86 

9*72 

68 

308-5 

0-217 4 0-026 


1-79 

20-25 

309 

909*5 

0-340 ±0-019 


5*00 

56-50 

285 

768-2 

0-361 ±0-021 


10-00 

113-00 

339 

763*6 

0-443 ±0-024 


20-00 

226-00 

320 

698-5 

0-477 ± 0-026 

aluminium 

7-50 

20*25 

208 

626-0 

0-315 ±0-022 



Figure 2. Transition curve observed (J. & R.) and calculated. . 


Discussion \ 

1. All points on the lead transition curve except the one for 1-8 cm. Pb can be 
fitted to an exponential function of the form 

R = 0*477 —0*29 exp (~x/4 m 6), (1) 

where x is the thickness of the absorber in cm. of lead. Thus, disregarding the point 
x = 1*8 cm. the transition effect can be accounted for in terms of a radiation of 





184 


L. Janossy and G. D. Rochester 

about 5 cm. range. This is exactly what is expected from the theory of Hamilton, 
Heitler & Peng ( 1943 ) and the theory put forward by J&nossy ( 1943 ). 

Equation ( 1 ) is plotted as a broken line, in figure 2 . It is seen that the deviation 
from the dotted line of the point corresponding to 1-8 cm. is equal to three times the 
standard error of this point, and therefore it is possible that the deviation is not of 
statistical origin. The points are represented somewhat better by equation ( 2 ) 
which is plotted as a full line in figure 2 : 

B = 0*46 — 0-20 exp (— x/4*6) — 0*09 exp ( — %)- (2) 

Two interpretations can be offered for the deviation from the exponential function 
represented by ( 1 ), provided the deviation is taken to be real: (a) One can assume 
that only part of the penetrating showers is produced by primaries of range 5 cm. 
lead, the rest being produced by radiation of range approximately 1 cm. of lead. 
This second radiation might well consist of photons. ( b ) One can assume that some 
of the penetrating showers are accompanied by cascade showers. Because of the 
ordinary transition effect of cascade showers in lead a penetrating shower will have 
a larger chance of discharging the top tray of counters when it is covered with 2 cm. 
lead than in the case of an uncovered tray. The second interpretation would account 
better for the observed deviation. 

2 . In figure 3 the experimental points as observed by J&nossy ( 1942 ) are plotted 
together with the curve given by ( 2 ). The general shape of the lead transition curves 
found by Janossy and by us are very nearly the same even though the two counter 



thickness of absorber in g. per cm. 2 
Figube 3 

arrangements are different. This shows that the same phenomenon is being observed 
with the two arrangements. The maximum change in rate between T = 0 and 
T = 10 cm. is for Janossy 5 s arrangement ( 1942 ) 0*28 c.p.hr., for the arrangement of 
Janossy & Rochester ( 1943 ) 0*25 c.p.hr., and for the present arrangement 0*26 c.p.hr. 
These rates must be considered the same when allowance is made for normal 
fluctuations. 



185 


The transition effect of penetrating showers 

3. The one reading obtained for aluminium does not agree with Janossy’s value 
for this particular thickness of aluminium. We have made a careful study of this 
point for both lead and aluminium and have not been able to find a difference between 
the rates for the two materials. Throughout the whole of this experiment readings 
were staggered so as to eliminate slow changes in the apparatus, e.g. in the efficiencies 
of the counters, or seasonal changes in the radiation producing the showers. We 
conclude, therefore, that greater weight must be given to this result than to the 
former. 

It is not known why the results of the two experiments differ, but it is unlikely 
that the difference can be due to different counter arrangements. Examination of 
figure 3 will show the previous deduction, that there is a difference between the 
rates for lead and aluminium, to be based chiefly on the points a and 6. Because 
of statistical fluctuation the other points might well lie on the curve. The difference 
found by Janossy might be due to an unusually large fluctuation in the point a. 


References 

Hamilton, J., Heitler, W. & Peng, H. W. 1943 Phys. Rev. 64, 78. 
Janossy, L. 1943 Proc. Roy. Soc. A, 179, 361. 

Janossy, L. 1943 Phys. Rev. 64, 345. 

Janossy, L. & Rochester, G. D. 1943 Proc. Roy. Soc. A, 182, 180. 
Janossy, L. & Rochester, G. D. 1944 Proc. Roy. Soc. A, 183, 186. 



The barometer effect of penetrating showers 


By L. JAnossy and G. D. Rochester 
The Physical Laboratories , The University , Manchester 

(i Communicated by R M. S. Blackett , F.R.S.—Received 2 March 1944) 

The barometer coefficient of penetrating showers is evaluated from an extended series of 
observations and is found to be 

B = -11*7 ± 2-7 % per cm. Hg. 

The significance of the correlation coefficient is discussed briefly. 


1 . Introduction 

In the course of experiments on penetrating showers we had occasion to note the 
intensities of penetrating showers and the barometric pressures for a period of 
16 months. Barometer correlation was not the main purpose of the experiments 
and therefore a number of different arrangements were used. 

Though the method of correlating barometric pressure with the cosmic ray- 
intensity is well known we sketch the procedure briefly in order to draw attention 
to certain points. 


2 . Calculation of the barometer coefficient 

Denote by G i the number of coincidences recorded in the ith reading. Denote 
similarly by t i the duration of the ith reading and by b i the average barometric 
pressure during this period. Introduce further the deviations of the individual 
values from the average according to the following formulae: 

7* = Q • - ms cy/s t k , & = &*-(s M ft )/2 h. (i) 

k k k k 

The barometer coefficient is obtained as 

£ = 2y*/ysw?L (2) 

k 

or, if expressed in percentage change per cm. of Hg 

51100 _JL m 

76 'EGJn’ ( ’ 

where n is the number of readings. 

[ 186 ] 



The barometer effect of penetrating showers 


187 


3. The correlation coefficient 


The question arises as to how far the barometer coefficient determined by (2) and 
(3) can be relied upon to express a real connexion between intensity and barometric 
pressure. 

In this connexion the correlation coefficient, r, defined by equation (4) is often 
introduced, 


%lh) 


( 4 ) 


For close correlation r has a value not very different from ±1. It should be 
pointed out, however, that if we obtain a small value of r it does not necessarily 
mean that the value of the barometer coefficient is uncertain. Indeed, the value 
of r depends on the way in which the measurements are carried out. 

Consider, for instance, a large number of short readings of the intensity. Because 
of statistical fluctuation individual readings will not follow closely the variation in 
the pressure. The correlation coefficient r will therefore be small. 

It is clear that for a group consisting of a large number of readings individual 
fluctuations will not affect the average value, and therefore the barometer coefficient 
obtained from a large number of single readings will be reliable in spite of the 
large uncertainties in the single readings and the small value of r. 

Similarly, one may consider variations which depend not only on the barometric 
pressure. In this case individual readings will not follow the barometer closely. 
Such fluctuations will, however, not influence the final result because they will 
cancel out on averaging over all readings, and the barometer coefficient obtained 
will have a real physical meaning. The correlation coefficient will be small in this case. 
The significance of this small value is that the intensity does not depend on the 
pressure only. 

We see thus that the smallness of the value of r does not decide whether or not 
the corresponding correlation is significant. 


4. Standard error of the barometer coefficient 

The correct way to decide whether any experimental result is relevant or not is 
to calculate the standard error of the result from the error (or fluctuation) of the 
individual measurements which contribute to it. 

The barometer coefficient can be regarded as a function of the measured quantities 
and We denote the standard errors of these quantities by 8y i and and assume 

^JiNh = dyd's/t^ — ... = 8y, = (fi) 

The standard errors of the average counting rate and the average pressure are of 
smaller order than those of the deviations y. t and jS t and therefore will be neglected. 



188 


L. Janossy and G. D. Rochester 

The standard error of the barometer coefficient can be obtained from ( 2 ) and ( 5 ) 
in the usual way. We find 

SB/B = ^{Zylhm 2 + ZPlt k (Sy 2 )}IZy k (S k . ( 6 ) 

If the error of the barometer readings is comparatively small we may put 

SB/B = dy (Sfil t k )/£y k fi k . ( 6 a) 


5. Amplitude of the non-barometric fluctuations 


Besides establishing the correlation between barometric pressure and intensity 
it is also of interest to find out if the fluctuation of the corrected values is purely due 
to statistical fluctuation. If, for example, this fluctuation exceeds the statistical 
fluctuation one may assume that factors other than the barometric pressure influence 
the intensity. 

We introduce the quantities 

Vi-Yi-WA (?) 


representing the deviations of individual readings from the average after having 
been reduced to average pressure. Regarding the 7j i as functions of the y i one finds 

Zvllt k = (8yf(n-1), ( 8 ) 

k 

and with help of equations ( 2 ), (5), ( 6 a) and ( 8 ) 

(8y)*=(l-r*)l(n-l). (9) 


The value of {Sy) 2 is the mean square deviation of the counting rates reduced to 
a standard pressure. The mean square deviation consists of two parts. One con¬ 
tribution is due to the statistical fluctuation and is given by {Sy st ) 2 — ZC k jZt ki and 
the second contribution is due to systematic changes of the intensity not due to 
barometric variation. We write for this second part (Sy nsf ) 2 — cr 2 . The total fluctua¬ 
tion is thus 


(Sy) 2 = BG k /Bt k +(r z , 


( 10 ) 


and with help of (2), (7) and (9) we obtain from (10) 


<r 2 




(id 


The above equation can be used to find the mean square amplitude cr 2 of the non- 
barometric variation. If <r 2 is small there is no indication of variations other than 
caused by changing pressure. 


6. Numerical values 

The values of the barometric coefficients, B, obtained for the various arrangements 
are collected in table 1 . The individual values have relatively large standard errors 
and we do not attach any importance to them. 



The barometer effect of penetrating showers 189 

% 

The weighted mean of the fourteen individual results exceeds, however, four 
times its standard error, and therefore the existence of a barometer effect is ascer¬ 
tained. 

The barometer coefficient thus obtained is much larger than the barometer 
. coefficient observed for the total cosmic ray flux which is of the order of 3 % per 
cm. Hg. 

The difference is probably even greater than appears from the figures, as approxi¬ 
mately 30 % of the penetrating showers recorded by the arrangement e / ? were 
knock-on showers. As the knock-on showers have the ordinary barometer effect 
the observed barometer effect of the mixture will be smaller than the effect of the 
penetrating showers alone. No correction for the admixture of knock-on showers 
has been made. 

We conclude therefore that the barometer effect of penetrating showers must be 
assumed to be larger than that of the ordinary cosmic ray beam and is probably of 
the order of 10 % per cm. Hg. 

No measurements of the barometer coefficient of penetrating showers have been 
carried out previously. Auger and Daudin ( 1942 ) and Cosyns ( 1940 ) have, however, 
determined the barometer coefficients of extensive air showers. The values found 
by these observers are between 10 and 20 % and are thus of the same order as the 
value found for the penetrating showers. 

An attempt to interpret these effects is flaade in the following paper. 

The value of <r 2 of the non-barometric variation was determined from equation 
( 11 ) and averaged over the various arrangements. The value obtained is slightly 


Table 1 


absorber 
cm. Pb 


number 

reference 

2 

T 

time 

hr. 

count 

B 

(dB/B) 

1 

I 

0 

0 

688 

368 

- 5-3 

M 

2 

I 

0 

3 

102 

'92 

- 10*0 

3*2 

3 

I 

0 

5 

163 

122 

-15*1 

1*0 

4 

I 

0 

10 

381 

311 

-10*4 

1*0 

5 

I 

5 

10 

1290 

1100 

-12*9 

0*43 

6 

I 

35 

10 

1362 

937 

- 4*3 

1*4 

7 

I 

0 

3 

206 

145 

+ 3*6 

3*2 

8 

II 

0 

3 

138 

55 

+ 15*9 

2*0 

9 

II 

0 

0 

965 

198 

- 10*0 

0*9 

10 

II 

0 

7-5* 

626 

208 

- 2*5 

2-9 

11 

II 

0 

2 

910 

309 

-18*6 

0*42 

12 

II 

0 

1 

309 

68 

- 21*8 

0*83 

13 

II 

0 

5 

586 

229 

-10*5 

0*80 

14 

II 

0 

10 

495 

230 

-14*0 

1*0 


weighted average B = — (11-7 ± 2-7) % per cm. Hg 


I, Janossy & Rochester ( 1943 ); H, Janossy & Rochester { 1944 ). 

* cm. Al. 


Vol. 183. A. 


*3 



190 L. Janossy and G. D. Rochester 

negative and therefore the fluctuation of the readings reduced to normal pressure 
is only statistical. There is no indication of a systematic variation due to causes 
other than variation of pressure. 

Since the statistical fluctuation is large it cannot be excluded that systematic 
variations with amplitudes much smaller than the barometric fluctuation occur but 
are masked by the statistical fluctuation. Variations with an amplitude comparable 
to the fluctuations due to pressure changes w T ould have been detected. 

We are indebted to Dr T. G. Cowling of Manchester University and to Dr 
0. R. Frisch of Liverpool University for useful comment. 


References 

Auger & Daudin 1942 Phys . Rev. 61, 91. 

Cosyns 1940 Nature , Lond 145, 668 . 

Janossy & Rochester 1943 Proc. Roy. Soc. A, 182, 180. 
Janossy & Rochester 1944 Proc. Roy. Soc. A, 183, 181. 


Some aspects of the production of mesons and the barometer 
effect of penetrating showers 

Ry L. Janossy, The Physical Laboratories , The University , Manchester 

(Communicated by P. M. S. Blackett , F.R.S.—Received 2 March 1944) 

It is shown that the large barometer effect of penetrating showers can be understood in terms 
of an exponential absorption of the primaries giving rise to the showers. A number of other 
experimental results favour such an assumption. It is shown further that the exponential 
absorption of the primaries is compatible with the theory of Hamilton, Heitler & Peng when 
account is taken of fluctuations. 


1. Introduction 

From a large number of observations of penetrating showers Janossy & Rochester 
(1944) have found the barometer coefficient of penetrating showers to be of the order 
of 10 % per cm. Hg. This barometer effect corresponds to an absorption of about 
1 % per g. of the atmosphere. Janossy & Rochester (1943a) have found, however, 
that the absorption of penetrating showers is only 0*1 % per g. of lead. Thus the 
absorption in air as deduced from the barometer effect appears to be ten times larger 
than that in lead. 

As penetrating showers probably consist mainly of mesons, one would expect an 
absorption according to mass, and it would be very difficult to understand a much 
stronger absorption in air than in lead. The instability of the meson could hardly 



Some aspects of the production of mesons 191 

account for this behaviour in terms of decaying mesons, for the decay of mesons 
would give rise to a temperature rather than to a barometer effects 

The difficulty can be overcome by assuming that the barometer effect is 
not so much due to the absorption of penetrating showers in air but to the 
absorption of the primaries producing them. The large barometer effect and the 
small absorption of penetrating showers in lead can be understood in the following 
way. 

Consider primaries which have just sufficient energy to penetrate the atmosphere 
at the normal pressure P. A fraction SN of the primaries will be absorbed in T 
(figure 1) giving rise to penetrating showers and thus will be recorded. If an extra 
absorber T x is placed on top of T, the fraction SN will give rise to penetrating showers 
in T x and not in P, but as the showers are only slightly absorbed in the absorber T 
they are almost as likely to be recorded when produced in T x as when produced in T\ 
thus the absorber T x will have little effect on the observed intensity. 



.Figure 1 . Showers produced in equivalent layers of lead and air. 

Compare the effect of the extra absorber T x with the effect of an equivalent increase 
Pi of the barometric pressure. P x should be so chosen that the extra air above the 
arrangement due to the pressure P x should have the same mass per unit area as the 
absorber T x . 

The effect of the increase of air pressure on the coincidence rate is very different 
from the effect of the absorber T v The fraction SN which could just reach the top 
of the apparatus when the air pressure was P will, when the pressure increases to 
P-f P 1? be stopped in that layer above the surface at which, after the change, the 
pressure has now become P. Because of the low density of air this layer will have a 
height of the order of 100 m. for average changes of P. Showers produced at these 
heights will be very diffuse when they reach the apparatus and will therefore usually 
not be recorded (figure 1). 


13-2 





192 


L. Janossy 

It follows that the dense absorber T x placed close above the apparatus produces 
a decrease according to the absorption of the showers in T 1} while an extra mass of 
air above the apparatus produces a decrease according to the absorption of the 
primaries in the air. 

The large barometer coefficient seems therefore to indicate that the primaries 
giving rise to penetrating showers are strongly absorbed in air. The absorption 
coefficient of the primaries which is required to account for the observed barometer 
effect is of the order 

IJ/ijp = 100 g. per cm. 2 . (1) 


2. Absorption op the primaries (experimental) 

There- is some experimental evidence to show that the primaries giving rise to 
penetrating showers are absorbed with an absorption coefficient jjl p of the order 
given in (1). 

(а) The transition effect of penetrating showers shows saturation at approxi¬ 
mately 5 cm. of lead. This saturation can be accounted for either by assuming that 
a large fraction of the incident primaries are absorbed in 5 cm. of lead or by assuming 
that the primaries are producing a number of penetrating secondaries in 5 cm. of 
lead sufficient to discharge the shower recording arrangement. 

The second alternative was put forward by Hamilton, Heitler & Peng (1943) 
[quoted in the following as H.H.P.] and by myself (1943), but the barometer effect 
seems to favour the first interpretation. It will be shown in the following that the 
two points of view can be reconciled. 

(б) It was shown by Janossy & Rochester (1943) that part of the penetrating 
showers is produced by neutral particles. Further, Rossi & Regener (1940) and 
Janossy & Rochester (19436) have observed a penetrating non-ionizing radiation 
with a mean free path of about 100 g. per cm. 2 . It is very probable that both of these 
neutral radiations consist of neutrons. The more energetic neutrons may be able to 
give rise to penetrating showers, while the bulk of the neutrons having too little 
energy for producing penetrating showers may give rise only to ionizing secondaries: 

The intensity found by Rossi & Regener at Mt Evans (4500 m. above sea-level) 
was estimated by us to be 30-60 times larger than the intensity of neutral particles 
in Manchester. The ratio of the intensities suggests an absorption coefficient of the 
order of (1). 


3. Absorption of primaries (theoretical) 

(a) Two main types of absorption phenomena have to be considered: 

(i) Catastrophic absorption . The primary loses all its energy in its first collision. 
The absorption law in this case is essentially exponential and does not depend on the 
incident spectrum, provided the collision cross-section does not vary much with 
energy. 



193 


Some aspects of the production of mesons 

(ii) Range absorption. The primaries lose energy gradually. The more energetic 
primaries penetrate deeper and the absorption law is a function of the energy 
spectrum. 

Intermediate processes may also occur. 

Consider the case in which the primaries lose a constant amount jS of energy per 
gram of absorber and assume ft to be independent of energy. Assume an incident 
spectrum of the type 

S(E) = (EJE)y (E>E q ). ( 2 ) 

The incident spectrum of the primaries is likely to be of this type, since it is known 
that the meson spectrum is between wide limits of this type, and it is not likely that 
the spectrum of the primaries producing the mesons is very different from the 
secondary meson spectrum. 

The energy spectrum at the depth x below the top of the atmosphere will then be 


S(E, x) = Ev/(E+pxy {E > E 0 ), ( 3 ) 

and, neglecting particles of energy less than E 0o the absorption function is given by 

I(x) = Ev/(E 0 +/3x)y. ( 4 a) 

(6) The barometer coefficient near sea-level, i.e. at x = H, is given by 

B = ( Hjl ) (dI/dx) x=H 1*33 % per cm. Hg; ( 5 ) 

if pH > E 0 this reduces to 

B — - l- 33 y % per cm. Hg (near sea-level). ( 5 a) 


As y is of the order of 2 the barometer coefficient is of the order of 3 % per cm. Hg. 
Such a value is observed for the bulk of cosmic rays. 

Note that the coefficient B in (5 a) is independent of the specific loss JS. Therefore 
the barometer coefficient for the primaries producing the mesons should also be 
of the order of 3 % if they lose energy gradually. 

(c) The barometric pressure at Mt Evans is 0-62 atm. Therefore the ratio of the 
intensities at Mt Evans and at sea-level is from ( 4 a) with y = 2, E Q < 4 fiH 

4tt Evans/“4ea-level = 0’62~ 2 = 2-6. ^ (6) 

This value is far too small to be compatible with the estimated ratio from 30 to 60 . 

Thus, assuming continuous absorption of the primaries, one fails to account 
either for the ratio of neutron intensities on Mt Evans and at sea-level or for the 
large barometer coefficient found at sea-level. 

* Both effects would be accounted for immediately by assuming catastrophic loss. 
Replacing equation (4 a) by 

I(x) = exp [— lOx/H], ( 46 ) 

the modified equations (5) and (6) give numerical values in good agreement with 
the experiments. 



194 L. Janossy 

Using (46) instead of (4a) it becomes also somewhat easier to account for the 
transition effect of penetrating showers. 

It seems, therefore, that the experimental facts are better accounted for in terms 
of catastropic absorption than in terms of range absorption. 


4. Comparison with the theory oe Hamilton, Heitler and Peng 

In this theory it is assumed that the incident primaries are protons and possibly 
neutrons. It is shown that these fast heavy particles lose energy in inelastic collisions 
with protons and neutrons resulting in the emission of mesons. The rate of energy 
loss for primaries of energy E is given by 

^ = 0*9 log, (0-3 E). (7) 

(All energies are given in units of jlio 2 , ji rest mass of the meson.) At first sight one 
should expect absorption according to range as a consequence of equation (7). In 
the following it will be shown, however, that owing to the large fluctuations of energy 
loss connected with the mechanism proposed by H.H.P. the absorption function is 
of the range type only for very large thicknesses, but for smaller thicknesses the 
absorption curve is more nearly represented by an exponential function. 

Though the number of single encounters of a primary on its way through the 
atmosphere is large, these encounters are not distributed at random for, as I have 
shown, a primary particle passing through a single nucleus is likely to collide with 
several nuclear particles (J&nossy 1943 ). As a result of this crowding of the single 
collisions a primary is likely to lose large amounts of energy in a few collisions. 

The energy required for a primary to penetrate the atmosphere is found with the 
help of equation (7) to be of the order of 7000, and it is estimated that this energy is 
lost in about 10 energetic collisions. 

Primaries with energies below 7000/10 = 700 are therefore expected to be stopped 
in the first collision. A primary with energy much less than 700 is thus only likely to 
reach sea-level if it happens not to collide with any nucleus on its way. The pro¬ 
bability of a primary not colliding once along H when its mean free path is 77/10 is 

e- 10 -1/20,000. 

Thus one out of 20,000 low-energy primaries will reach sea-level. The fraction of the 
primaries having sufficient energy to penetrate the atmosphere without fluctuation 
is (EJlOOOy, where E 0 is the smallest incident energy sufficient for producing pene¬ 
trating showers at sea-level. Assumingy = 2andi7 0 = 50 this fraction is also 1 / 20 , 000 . 
Thus the number of low-energy primaries having reached sea-level by fluctuation 
is of the same order as the number of high-energy primaries reaching sea-level after 
having suffered average loss. 

Roughly speaking the primaries behave as if they consisted of two independent 
components, one component being absorbed exponentially, the other having a 
range absorption curve. Down to a certain depth the exponential component is 



Some aspects of the production of mesons 195 

preponderant, while below this depth the fluctuations become unimportant and the 
absorption curve corresponds to the average loss. 

Assuming that the exponential part of the absorption curve persists to a depth 
somewhat below sea-level, all experiments described in the last section can be 
accounted for in terms of the theory of H.H.P. 

As the absorption with fluctuation is a rather complex process, it seems worth 
while to confirm these qualitative considerations by a more detailed calculation. 


5. Mathematical treatment of the fluctuation of energy loss 


Diffusion equation 

In the following the total energy loss will be subdivided into two parts: (1) large 
losses: write for the probability of an energy loss in a single nuclear collision exceeding 
y along the path dx, L(y)dx\ (2) small losses: it will be assumed that the loss along’ 
a path dx is equal to j3dx, /? being independent of the energy. The diffusion equation 
giving the change of the differential energy spectrum D{E , x) can be written as 


dD{E,x) = _ L ( Q)D ( E)X ) +j3 d:D ( E ’ x ) 


dx 


BE 


■j; 


^D(E+y,x)^dy. 


( 8 ) 


Integrating by parts equation (8) can be transformed into 
dB(E,x) _ fi dD(E,x) r*dD(E + y,z) ,, 

Tx . + J„"' dE— m ' ly ' 

a particular solution of this equation is 

J) t (E,x) = exp [-eE-A(e)x] 

A(e) = fie + ej L(y) dy. 


with 


(») 

( 10 ) 

( 11 ) 


The general solution of equation (8) is therefore 

D(E,x) = | a(e) exp [ — eE —A(e) x] de, (12) 

Jo 

where a(e) has to be chosen so that the solution obtained satisfies a boundary con¬ 
dition, that is a(e) has to satisfy equation (13), 


(*O0 

D(E, 0) = I a(e) exp (- eE)de. (13) 

The general expression for a(e) can be obtained from equation (13) by means of 
the Mellin transform. We are, however, only interested in the special case where the 
incident spectrum obeys a power law. In this case we have 

D(E, 0) = yEl!Ev +x and a(e) = (eE 0 )r/(y -1)!. 


(14) 



196 L. Janossy 

The solution of equation (8) satisfying the boundary condition (14) is therefore 

D{E,x) = : P° (eE 0 )v exp \_—eE — A(e)a:] de. (15) 

(r-i)Uo 

Integrating equation (12) with respect to E from E 0 to infinity one obtains the 
integral spectrum. It is found that’ 

S(E 0 , x) = ^ f“ (eE 0 )y exp [-eE 0 - A(e) x] de. (16) 

y 1 J o 

The values of S(E 0) x) as obtained from equation (16) depend on the choice of the 
function L(y). Before evaluating equation (16) for the case of energy loss due to 
meson production two extreme cases may be considered. 

(а) L(y) = const. — This represents a case in which the primaries are 

stopped in the first collision independently of their energy. It is found with the help 
of equation (11) that 

A(e) = /?e + (11a) 

introducing this expression into equation (16) one finds 

jpy 

S{E °’ X) = (^w exp (16a) 

Thus the absorption is essentially exponential. The barometer coefficient as 
defined in equation (5) will be found to be 

J?--l-8 B(y+jH>»p). (56) 

It can be inferred from this equation that the observed barometer effect can be 
accounted for by assuming sufficiently large but infrequent losses for the primaries. 

(б) L(y) is very large for small values of y and is zero for larger values, so that the 
average rate of loss due to L(y) is y?, r In this case one finds 

Me) = (fi+fije 

and 

O/TTT „N_ M 

{E °’ } w+ifi+W 

thus the absorption is essentially a range absorption; this case is incompatible with 
experiment. 

( c ) To treat the case of energy loss according to the theory of H.H.P. assume 

(0-9 /(y-B) (y>y 1 ), 

L(y) = - (17)l 

(10x7-10- 3 (y<y x ), 

with y x — 320 and B = 240. A justification of this expression will be given in the 
Appendix. Further, assume = 0*02, this representing the ionization loss suffered 
by a proton. 


( 116 ) 

( 166 ) 



197 


Some aspects of the production of mesons 

Introducing equation (17) into equation (11) 

A(e) = /3e-0-9ee~ Se Ei(-(y 1 -B)e), ( 11 c) 

Ei( — x) is the exponential integral. No analytic expression for equation (16) can be 
obtained in this case, but the integral can be evaluated by the saddle-point method. 

The integrand in equation (14) vanishes both for small and large values of e and 
has therefore maxima between. 

Proceeding in the usual way, write for the integrand on the right-hand side of (16) 

integrand = exp [ —/(e)], (18) 

/(e) = eE 0 +A(e) x - In (eE 0 ) ; (19) 

saddle points are defined by 

/'(e) = jE7 0 + A'(e)a; — - = 0 for e = e 0 . ( 20 ) 

It is convenient to represent all quantities as functions of the independent parameter 
e 0 . Thus it is 

1 _E o 

x{e ° ] = (21) 

x is plotted as function of e 0 in figure 2 . This graph is rather remarkable. It is seen 
that in the region 

320 <*<1600, ( 22 ) 

/(e) has three extrema, i.e. two minima and a maximum. The value S of the integral 
(16) for values of a; in the interval given by equation ( 22 ) can be regarded as the sum 
$ x +# 2 of the contributions of the two saddle'points. 



Figure 2. Position of saddle points. 



198 L. Janossy 

Consider first the branch corresponding to the larger values of x and the small 
values of e 0 . From equation (11c) the following asymptotic expression valid for 
small values of e can be derived 

A'(e)f=wA(e)/e«5dn(140e). (23) 

Introducing this into equation (18) and carrying out the integration in the neigh¬ 
bourhood of the first saddle point, then the contribution of this saddle is 

S^x) = 1/nel \ 

i (24) 

x = l/( — 0-9e o ln(140e 0 )).J 

Comparing equations (24) and (7) it is noted that x is roughly equal to the range of 
the primaries with energy l/e 0 , while is roughly the fraction of incident particles 
with energies exceeding l/e 0 . Thus the branch of the curve (figure 2) corresponding 
to the small values of e 0 represents the fictitious component showing range absorption. 



Figure 3. Absorption of primaries in the atmosphere. 


The contribution S 2 due to the branch corresponding to the larger values of e 0 has 
been evaluated numerically, and it is found to be very nearly exponential in %. Both 
contributions 8 1 and S 2 as well as S = S x + S 2 are plotted in figure 3. It is seen from 
figure 3 that our expectations regarding the nature of the absorption curve in § 3 
are confirmed by the quantitative treatment. The absorption is purely exponential 
down to a critical depth H c , while for greater depth the curve is represented by a 
power function. The critical depth is, according to figure 3, about 800 g, per cm. 2 . 
This value is somewhat short of the total depth of the atmosphere. Considering, 
however, the large uncertainties in the numerical values for the cross-sections 
involved one cannot attach any significance to the exact numerical value of this 
critical depth. 



199 


Some aspects of the production of mesons 

It can thus be concluded that the exponential absorption of the primaries should 
persist down to a critical depth which is of the same order as the height of the 
atmosphere. 

6. Comparison with experiment 

(a) The ratio of the primary intensities on Mt Evans and near sea-level as cal¬ 
culated from figure 3 is about thirty, in agreement with the experimental estimate. 

(b) Cloud-chamber photographs at high altitudes have been obtained by Powell 
( 1940 , 1941 ), Hughes ( 1941 ), Wollan ( 1941 ) and Hazen ( 1944 ). These photographs 
show groups of penetrating particles. No evidence is, however, found for the 
occurrence of more complicated processes such as one would expect due to energetic 
heavy particles capable of passing through more than one nucleus. These experi¬ 
mental findings are to be expected according to the present picture. The small 
groups of mesons can be attributed to the exponential component which must be 
expected to be preponderant at 3000 m. above sea-level. More complicated pene¬ 
trating showers must be comparatively rare at these heights, as most of the energetic 
primaries are contained in the range component which is relatively weak there. 

(c) The barometer coefficient near sea-level taken from the slope of the curve in 
figure 3 is found to be 7 % per cm. Hg. 

This value is somewhat smaller than the observed value. The disagreement is, 
however, by no means serious, as the barometer coefficient changes very rapidly 
with depth in the region of the critical depth. Indeed, at heights above the critical 
depth where the absorption is nearly exponential the barometer coefficient as 
calculated from figure 3 is 16 % per cm. Hg. Along a comparatively short distance 
the value of the coefficient drops down to about 3 %. Thus a slight change in the 
estimation of H c would result in a large change of the estimated value of the baro¬ 
meter coefficient at sea-level. 

Barometer effect of extensive air showers 

It can be shown easily that the exponential component consists mainly of low- 
energy particles, that is, of particles with energies of about 30-130. Extensive air 
showers observed by counters separated by a few metres have necessarily much 
higher energies and therefore the exponentially absorbed component cannot be 
responsible for the production of the extensive air showers. 

The proposed theory gives therefore no explanation for the large' barometer 
coefficients of extensive air showers observed by Auger & Daudin ( 1942 ) and Cosyns 
( 1940 ). The large barometer coefficient of extensive air showers may be due to the 
absorption of the showers themselves and not to the absorption of the primaries 
producing them. A different view was put forward by Auger & Daudin ( 1942 ). 

Energies of penetrating showers 

To account for the barometer effect of penetrating showers near sea-level in 
terms of the exponentially absorbed component it is necessary to assume that 



200 L. Janossy 

about half of the showers at sea-level are due to the exponential component. This 
half of the showers should consist of showers with total energies not exceeding 130 
and having often much lower energies. 

Though at present no direct evidence supporting this prediction is available, it 
may be pointed out that this is quite compatible with my original estimation of the 
energies of penetrating showers based on observation (J&nossy 1942 , page 369). 

Latitude effect of penetrating showers 

If the present interpretation is correct, penetrating showers should show a 
marked latitude effect. All the energies giving rise to the exponential component 
are forbidden at low latitude, and only the component due to high-energy primaries 
should be observed there. If the exponential component amounts to 60 % at high 
latitudes the latitude effect should also amount to 60 %. 

Further, because of the absence of the exponential component the barometer 
effect at low latitudes should be of the order of 3 % only. 

The shape of the transition curve for penetrating showers at low latitudes might 
also prove somewhat different from that at high latitudes. 

The only penetrating shower experiments carried out at low latitudes are those 
of Wathaghin, Santos & Pompeia ( 1940 ). The coincidence rates observed by Watha- 
ghin are of the same order as those recorded in Manchester, but as the arrangements 
used are rather different, no quantitative conclusions can be drawn. 

Confirmation of these predictions would support the mechanism put forward in 
this paper. On the other hand, should these predictions prove wrong, one would 
be forced to assume that primaries of sufficient energy to penetrate the earth’s 
magnetic field at low latitudes, i.e. primaries largely exceeding 600, still showed 
exponential absorption curve. Such behaviour might be accounted for by assuming 
that high-energy protons and neutrons suffer energy losses greatly exceeding those 
predicted by H.H.P. At present there is, however, no need for such an assumption. 


7. Appendix 

The differential cross-section for the emission of mesons in close collisions is given 
by H.H.P. as 

d0 = {ajE i )dE, (25) 

with a = 1-5 x 10 -24 cm. 2 . This cross-section can be interpreted schematically by 
assuming that in a collision with the closest approach x a meson of energy 

E = ainx 1 (26) 

is emitted, provided x is not too large and provided E is less than the primary energy. 

Two kinds of collisions have to be considered: ( 1 ) collisions where the fast particle 
crosses the actual area of the nucleus; ( 2 ) collisions where the fast particle by-passes 
the nucleus. In the second kind of collision the fast particle suffers distant collisions 



201 


Some aspects of the production of mesons 

with all of the nuclear constituents, and it should give rise to the emission of slow 
mesons. In this case the assumption that the nuclear particles are independent is 
certainly not correct and the emission of mesons cannot be estimated in a simple 
way. It seems, however, reasonable to assume that in the case of the distant collision 
the nucleus acts more like a single body and the collision will be elastic with little 
loss of energy. We shall therefore neglect the loss of energy due to the distant 
collisions. 

The energy loss in collisions where the primary passes through the nucleus* is 
estimatedroughly asfollows. The radius r A of anucleus of weights may be assumed as 

r A = 0-53{AW - 1) (c 2 /mc 2 ), (27) 

as shown elsewhere (Janossy 1943 ). A distance r 0 can be so defined that a fast 
particle traversing a disk with radius r 0 + r A round the centre of the nucleus suffers 
in average only one collision having a distance of approach smaller than r 0 . Ascribing 

an area Trr\ to each of the A nuclear particles one has 

% 

Anr\ = n(r 0 + r A ) s , (28) 

and therefore r 0 = r A /3 for A = 16. 

Thus a fast particle traversing a nucleus, will usually collide with one particle at 
a distance smaller than r 0 and suffer A — 1 more distant collisions with the other 
particles. Averaging over all possible points of intersection of the fast particle with 
the nucleus I obtained with the help of equation (26) for A = 16 for this loss 

Vi = 240. (29) 

The energy y. z which is lost in the closest collision has to be added. The probability 
per nucleus that the distance of this collision lies in the interval x, dx is 2nAxdx 
with x < r 0 . Therefore with the help of equation (26) 

d$ z (y 2 ) = Aadyjyl (30) 

is obtained for the differential cross-section of a loss y 2 . As the closest collision is 
assumed to have a distance less than r 0 it must be assumed 

y z > a/ml = 80. (31) 

With the help of equations (28), (30) and (31) one finds for the integral cross-section 

# 2 ( 2 / 2 ) = ^o + ^) 2 8 % 2 » (32) 

The total loss suffered is, however, y — y-L+y 2 , and therefore the cross-section for 
the loss of energy exceeding y is 

<P(y) = (r 0 +r A )*80/(y-240) (yZ 320). (33) 

Introducing numerical values into equation (33) one obtains finally, 

L(y) = ( N/A)0(y) = 0-9/(y-240) (y> 320). 


(34) 



202 L. Janossy 

Losses smaller than 320 occur according to our picture only when a particle of energy 
E less than 320 hits a nucleus. Such a particle is expected to lose the whole of its 
energy. Thus 

> L(y) = £(320) = 10*7 x 10~ 3 (y < 320). (35) 

Equatipns (34) and (35) are valid for y ^ E; they have to be replaced by L(y) = 0 for 
y > E. 

The actual shape of L(y) is not very important for the conclusions given here. 
It is, however, important for the conclusions that the losses due to the more distant 
collisions shall be neglected. 

If one did not neglect the more distant collisions, but assumed that the energy loss 
in distant collisions was still given by the sum of the individual losses with the 
nuclear particles, one would obtain an additional term to L(y). The effect of this 
additional term would be the same as if the ionization loss of the particles was 
increased about ten times. Thus with this extra term no particle with energy less 
than 300 could reach sea-level and therefore the whole exponential component 
w r ould be cut out. 

It seems, however, not unreasonable to cut off the small losses as has been done 
above. 

I am indebted to Professor W. Heitler for commenting on this paper. 


Repebences 

Auger & Daudin 1942 Phys . Bev. 61, 91. 

Cosyns 1940 Nature, Lond., 145, 668 . 

Hamilton, Heitler & Peng 1943 Phys. Bev. 64, 78. 
Hazen 1944 Phys. Bev. 65,67. 

Hughes 1941 Phys. Bev. 60, 414. 

Janossy 1942 Proc. Boy . Soc. A, 179, 361. 

Janossy 1943 Phys . Bev. 64, 345. 

Janossy & Rochester 1943 a Proc . Boy. Soc. A, 182, 180. 
Janossy & Rochester 19436 Proc. Boy. Soc. A, 181, 399. 
Janossy & Rochester 1944 Proc. Boy. Soc. A, 183, 186. 
Powell 1940 Phys. Bev. 58, 474. 

Powell 1941 Phys. Bev. ,60, 413. 

Rossi & Regener 1940 Phys. Bev. 58, 837. 

Wathaghin, Santos & Pompeia 1940 Phys. Bev. 57, 61. 
Wollan 1941 Phys. Bev. 60, 532. 



Statistical thermodynamics of mixtures with zero 
energies of mixing 

By E. A. Guggenheim, M.A., Sc.D., Imperial College of Science. 

0 Communicated by Sir Ralph Fowler, F.R.S.—Received 3 March 1944) 

A general formula has been obtained for the number of distinct arrangements of a mixture of 
any number of different types of molecules each with its own geometric properties. From this 
are deduced the thermodynamic properties of such mixtures when the energies of mixing are 
zero. In particular, the generalization of Raoult’s law has been obtained. The technique used 
is considerably simpler than that previously applied to problems of this type. 


1 . Introduction. Van’t Hoff’s ( 1903 ) formulation of the laws of extremely dilute 
solutions was followed by a period of confusion, when these laws were inaccurately 
extrapolated to high concentrations. A new era began when G. N. Lewis ( 1908 ) 
defined a perfect solution as one which obeys Raoult’s law in the form: the 
vapour pressure of the solvent is at constant temperature (and constant external 
pressure) directly proportional to the molecular fraction of the solvent. Lewis 
drew attention to the fact that mixtures exist, in particular benzene and ethylene 
chloride, which obey Raoult’s law throughout the whole range of compositions, but 
he made no attempt to state the necessary conditions for this to occur. Shortly 
afterwards Washburn ( 1910 ) formulated the most important thermodynamic 
relations for a solution obeying Raoult’s law throughout the whole range of com-, 
positions. He called such a solution ‘ideal’ and suggested that each real mixture 
should be compared with the ideal. He wrote: ‘ Wir konnen ziemlich sicher sein, dass 
eine Abweichung von der Eorderungen dieser Theorie in einem gegebenen Fall auf 
physikalischen oder chemischen Ursachen beruht... und dass sie nicht das Resultat 
von Bemuhungen darstellt, eine Gruppe von unvollkommenen Gesetzen anzu- 
wenden, die fur keine Art von koncentrierten Losungen... gelten’, and ‘Damit die 
Gesetze der Idealen Losung anwendbar seien, darf die Natur des Mediums oder das 
Rraftefeld in dem sich die Molekiile in der Losung befinden, nicht sehr verschieden 
von demjenigen der reinen Fltissigkeit selbst sein. ’ Beyond this Washburn refrained 
from attempting a precise formulation of the necessary conditions for a mixture 
to be ideal. 

It is easily shown thermodynamically that two or more substances can form a 
perfect or ideal mixture, only if .they mix, at constant temperature and pressure, 
without any change of energy or volume. These conditions, known to be necessary, 
came to be regarded as sufficient (Hildebrand 1924 ) for ideality, until the contrary 
was pointed out (Guggenheim 1933 ) and the name ‘semi-ideal’ was proposed for 
mixtures having zero energy of mixing and zero volume change on mixing, but not 
obeying Raoult’s law. 


[ 203 ] 



204 E. A. Guggenheim 

In 1932 I gave a statistical mechanical derivation (Guggenheim 1932 ) of the laws 
of ideal solutions requiring the assumption, among others, that the several kinds of 
molecule may be treated as spheres of at least approximately the same size, so that 
one could consider two configurations differing from each other only by the inter-, 
change of two molecules of different kinds. At a discussion held in Edinburgh on 
4 Liquids and Solutions 5 1 pointed out ( 1937 ) that there were no grounds for believing 
that mixtures of molecules differing greatly in size would obey Raoult’s law. Fol¬ 
lowing this discussion Fowler & Rushbrooke ( 1937 ) applied statistical mechanics 
to a mixture of two kinds of molecules A and B , the former each occupying two sites 
of a lattice and the latter each occupying one. They obtained formulae valid for 
mixtures dilute with respect to A and mixtures dilute with respect to jB, showing 
that Raoult’s law is not obeyed even when the energy of mixing is zero. The 
essentially similar problem of adsorption of molecules each occupying two sites on 
a surface lattice was then taken up by Chang ( 1939 ), who obtained explicit formulae 
valid throughout the whole range of concentrations, by using the mathematical 
technique developed by Bethe for treating order-disorder transitions in alloys. 
Chang’s results were adapted to mixtures of double and single molecules by Fowler 
& Quggenheim ( 1939 ), and the laws relating partial vapour pressure to composition 
for such mixtures were derived. Chang’s method of approach was subsequently 
applied by Miller ( 1942 , 1943 ) to mixtures of two kinds of molecules each occupying 
respectively three sites and one site. Miller also wrote down by analogy, but without 
proof, correct formulae for a mixture of chain-like molecules each occupying r sites 
and molecules each occupying a single site. 

Chang’s formulae were published just in time to be quoted and used by Fowler & 
Guggenheim ( 1939 , p. 367), but owing to the war it is only recently that I have had 
occasion to study Chang’s papers in detail. It has already been mentioned that Chang 
used a technique invented by Bethe for the study of order-disorder in alloys. Now this 
technique is superior to the incomparably simpler technique previously applied to 
these problems by Bragg and Williams only in that due account is taken of deviations 
from randomness due to the non-zero energy of mixing, an effect neglected by Bragg 
and Williams. It would therefore be surprising if the complicated technique of 
Bethe, invented to take account of a second-order effect of a non-zero energy of 
mixing, were really required in a problem in which the energy of mixing is assumed 
to be zero. I accordingly became suspicious that Chang’s results should be derivable 
by simpler methods. This suspicion became a conviction, when I found that most of 
Chang’s formulae could be transcribed into alternative forms exhibiting a simplicity 
and symmetry previously latent. Once convinced that a simple derivation existed, 
I had little difficulty in finding it. 

The object of this paper is accordingly to give an elementary derivation of the 
thermodynamic formulae for systems with zero energies of mixing and zero volume 
changes of mixing. In the first place I shall consider mixtures of two kinds of molecules 
each occupying respectively r sites and one site*, deriving the formulae already 
obtained by Miller by analogy, but without proof. Chang’s formulae are immedi- 



Statistical thermodynamics of mixtures 205 

ately derivable as the simplest special case. According to the new derivation. Miller’s 
formulae, proposed for simple chain molecules, are equally applicable to molecules 
with branched chains. In the second place I shall indicate briefly how the new 
simplified treatment leads immediately to formulae for mixtures of any number of 
kinds of molecules occupying any number of sites. The application of Bethe’s method 
to systems of such complexity would be a formidable task and it is good to know 
that it is unnecessary. 

2. Statement of problem and notation. Consider a system of N A molecules A each 
pccupying r sites and N B molecules B each occupying one site, the total number of 
sites being N s — rN A + N B . Denote by z the number of sites which are neighbours 
of any given site. Then the number of sites which are neighbours of a B molecule is 
of course z, but the number of sites which are neighbours of an A molecule is not rz 
(except in the trivial case r — 1), but say qz, where q<r because some of the sites 
adjoining a site occupied by an element of a molecule A are occupied by the next 
element of the same molecule. Provided the A molecules are simple chains or 
branched chains, without any closed rings, the relation between q and r is 

qz = rz — 2r + 2. (2-1) 

The exclusion of molecules having closed rings is an essential condition for the 
ensuing argument and the formulae obtained are therefore not applicable to 
molecules with closed rings. In the case of flexible open chain molecules, which 
can bend back on themselves, the ensuing argument is applicable provided q is 
still defined by (2-1), but the definition in words of q has to be modified as follows: 
qz is the number of sites neighbours of the r sites occupied by a molecule A, 
excluding those neighbours of each site occupied by the next element of the 
same molecule. 

It is a fundamental assumption in the present problem that the energies of mixing 
are zero; in other words, that all configurations have the same configurational energy. 
The more complicated problem, when the energies of mixing are not zero, will be 
treated in the following paper (p. 213). 

3. Essential statistics. Since all configurations are assumed to have the same energy, 
it follows that at a given temperature they all have the same statistical weight. This 
is equivalent to the statement that the arrangement of the molecules will be com¬ 
pletely random. A number of consequences follow immediately from this random¬ 
ness, in particular the following. 

The frequency of occupation of a chosen site by a B molecule is N B j(rN A +N B ), 
and the frequency of its occupation by an A molecule is rN A /{rN A + N B ). 

If a site is occupied by a B molecule, then the chances that a given neighbouring 
site is occupied by another B molecule or by an A molecule are as N s is to qN A . 

If a site is occupied by an A molecule, then the chances that a given neighbouring 
site is occupied by another element of the same molecule or by another molecule 
{A or B) are as r — q is to q. 


Vol. 183. A. 


14 



206 


E. A. Guggenheim 


If a site is occupied by an A molecule, then the chances that a given neighbouring 
site is occupied by a B molecule or by a different A molecule are as N B is to qN A . 

Prom these follow immediately the following expressions for the frequencies of 
occupation of a chosen pair of sites in alternative ways: 


Both by B 

First by B, second by A 
First by A, second by B 
Both by different A’s 
Both by same A 


Nb Nb__ . 

(tN a + N b ) (qN A + N s Y 

Nb qNA . 

(rN A + N B ) ( qN A +N B y 

Nb . 

(rN A + N B ) (qN A + N B y 

qN A qN A 

(rN A + N B ) (< qN A +N B y 

(rN A + N B y ■ 


(3-1) 

(3-2) 

(3-3) 

(3-4) 


(3-5). 


The correctness of these values is readily verified by checking the values of the 
ratio of (3-1) to (3*2), of (3-3) to (3-4) and of (3*5) to the sum of (3*3) and (3*4); also 
by checking the values of the sum of (3*1) and (3*2) and of the sum of (3*3), .(3*4) 
and (3*5). A thorough study of these expressions and the relations between them is 
profitable, although we shall in fact require to use only (3*1). 

Now consider a particular set of r sites so interrelated that it is possible for them 
to be occupied by an A molecule. If the A molecules are flexible, the set of r sites 
must be such that an A molecule occupying them is not bent back on to itself. 
According to the assumption of randomness the frequency of occupation of this 
set of sites by an A molecule must be directly proportional to N A , the total 
number of such molecules. This frequency can be denoted by KN A /N S , where K 
depends on z, r, q but is independent of N Ai N B . The frequency of occupation of 
the whole set of sites considered by B molecules will also be required. By an obvious 
extension of (3*1), this frequency is - 


jfc l N b V-i 
(rN A + N B )\qN A +N B ) ‘ 


(3*6) 


4. Kinetic argument. According to the principle of detailed balancing, when a 
system is in equilibrium any single process balances the inverse process. It is there¬ 
fore possible to derive correct equilibrium conditions by considering an arbitrary 
elementary process and its inverse, even though the elementary processes considered 
may be so extremely infrequent, compared with some other elementary mechanisms, 
that they do not appreciably contribute to the attainment or maintenance of equi¬ 
libria. Making use of this principle one can obtain an extremely simple kinetic 
derivation of the equilibrium conditions sought. 



207 


Statistical thermodynamics of mixtures 


The elementary processes chosen for consideration are: 

(a) the removal (‘evaporation’) of a molecule A from a selected set of r sites and 
its simultaneous replacement ( £ condensation 5 ) on these sites by r molecules B from 
the gas phase; 


(b) the inverse process of removal (' evaporation ’) of r molecules B from the same 
set of sites and their simultaneous replacement ('condensation’) by a molecule A 
from the gas phase. 

The rate of process (a) will be directly proportional to the rth power of p B) the 
partial pressure of B in the gas phase and to the frequency of occupation of the set 
of sites by an A molecule. According to the previous section, the rate will therefore 
be directly proportional to 


Pb 


rN A 


rN A + N B 


(4-1) 


The rate of process (b) will be directly proportional to p A , the partial pressure of 
A in the gas phase, and to the frequency of occupation of the whole set of sites by 
r molecules B. According to (3-6), the rate will therefore be directly proportional to 

pA (r2^+Ns){^^ ■ (4 ' 2) 


At equilibrium the rates of the elementary processes (a) and ( b ) must be equal and 
consequently 


p 4 rN A (qN A +N„y-' 
N }i [ N b ) ’ 


(4-3) 


where C' depends on the characteristics of the molecules A and B ; in particular on 
r, q and z; and finally on the temperature T; but O' is independent of N A and N B . 

It will be convenient to make considerable use of thermodynamic functions 
denoted by A, related to Gibbs’ potentials ju by /t = JcT log A, and conveniently 
referred to by the name ‘absolute activities’ (Fowler & Guggenheim 1939 , p. 66 ). 
The relation between the partial pressure p of a perfect gas and its absolute 
activity A is 

A = p/kT<f>(T), (4-4) 


where <p{T) is the partition function of the gaseous molecule with the volume factor 
removed. All that' matters here is that the ratio A/p is independent of p. Therefore 
(4-3) may be replaced by the similar relation 


n /' 

^ A/ ~ N b \ 


% J 


r -1 


(4-5) 


where C, like O', depends on the characteristics of the molecules A and B and on 
the temperature, but is independent of N A and N B . 

If f A and f B are used to denote the partition functions of molecules A and B 
respectively attached to specified sites, then X A fJN A and X B f B jN B are con- 


14-2 



208 


E. A. Guggenheim 


figurational factors related to the geometry of the molecules and of the lattice. 
Since, moreover, only systems in which all lattice configurations have equal energy 
are being considered, X A f A and A B f B may be expected to be independent of tempera¬ 
ture. It is accordingly convenient to replace (4-5) by the equivalent relation 


Ma y - 1 

P M B ) r ~N B \ n b ) 


(4-6) 


where p, like G and C\ is independent of N A and N B but may depend on r, q and z. 
It may, however, be expected, and will later be verified, that p , unlike C and O', 
is independent of the temperature. 

The reader who is thoroughly conversant with the use of grand partition functions 
and the part played in these by the absolute activities A, could probably write down 
(4-6) directly without need of the kinetic argument. He could perhaps also write 
down the exact form of the constant p, but he would probably have difficulty in 
giving a rigid proof. I therefore prefer to leave the form of p indefinite at this stage. 
All the required thermodynamic formulae can be derived from (4*6). 

5. Gibbs function and free energy . According to the definitions of the absolute 
activities A^, X B and the partial potentials fi Ai /i B these are related to the Gibbs 
function G at constant temperature and pressure according to 

dG ~ Nb — JcT log \ A dN A + kT log A B dN B . (5*1) 

In order to make use of (4*6) in the integration of (5*1), one transforms the inde¬ 
pendent variables from N A ,N B to N s , N A using 

N b = N s -rN A . (5-2) 

Differentiating (5*2) and substituting into (5-1), one obtains 

dO = (ji A - rji B ) dN A +ju, B dN s , (5-3) 

so that 

Substituting from (5-2) into (4-6) and using the result in (5-4), one obtains 

= log N a - r log (N s - rN A ) + (r -1) log (N s -[r-q] N A ) 

— log/* + r log/ B - log/). (5-5) 

When (5*5) is integrated the integration constant is determined by the condition 
that when N A is zero the formula for G must reduce to 


^ = -N s logf B (N a = 0 ). 


( 5 - 6 ) 



Statistical thermodynamics of mixtures 


209 


Integrating (5-5) subject to the condition (5-6) one obtains for the Gibbs function G 
and the free energy F, assuming the pressure to be small, 

L = ± = loo- Na ' + ^-- ^ )! - r -zl lo. 

kT kT ^ ° N s \ r-q b N s \ 

-N A logp-N A logf A -(N s -rN A )logf B . (5-7) 

Now transforming back to the variables N A , N B one obtains 

Tft = = log N a ! + log ! - ~ log (qN A + N B ) ! + Vll log (rN A + N s ) ! 

- N a log p-N A logf A - N b logf B . (5-8) 

6. Partial potentials and vapour pressures . All the important thermodynamic 
properties of the mixture can be obtained from (5-8) by differentiation. If the 
subscript 0 is used to denote the value of a quantity for the substance A or B in its 
pure state, then one obtains for the partial potentials /i A , ju B , the absolute 
activities \ A , X B and the partial vapour pressures p A , p B the formulae 


exp 


kT 


U»o\ _ Al _ 2A - N — 

I 1 n n , „ A > «(r-D’ 

/ *a, o Pa,o (N A + N B /q)~i=r 


( 6 - 1 ) 


(Pb~Fb, o\ _ A? _ p B , T (r 
exv [~kT-)-T B ~- pB , 0 - N “ 


(rN A + N b ) r -5 


( 6 - 2 ) 


(qN A +N B )r-<t 

It is readily verified that ( 6 - 1 ) and (6-2) satisfy (4-3) and the necessary relation 


l^/ l A \ 

WjJ Na \dN A J Ns 


(6-3) 


When the value of q given in ( 2 - 1 ) is substituted in ( 6 - 1 ) and ( 6 - 2 ) Miller’s 
formulae (29) and (30) of 1943 are obtained. In particular when r = 2 , q = 2(z- 1 )/z, 
the formulae (821,10) and (821,11) of Fowler & Guggenheim ( 1939 ) are obtained. 
In making this substitution, it is convenient to note that from ( 2 - 1 ) 


— = \z, -—- = bz-\. 


r — q 


(6-4) 


The simplest form of the vapour-pressure formulae is obtained from ( 6 * 1 ), ( 6 - 2 ) by 
using ( 6 - 4 ) where most convenient, but leaving q and r unchanged in other places. 
Thus 

At . 

A,o 


X B,0 


Pa 

N a 

(N A +N s /r \* 

( 6 - 5 ) 

Pa. 0 

~.(N A + N B /r)\N A +N B lq) 

Pb 

_ N * , 

(rN A +N B f 

( 6 - 6 ) 

Pb. 0 

~rN A + N B 

\qN A +N B ) • 



210 E. A. Guggenheim 

In the special case of a linear array z = 2 , q = 1 and formulae (6-5), ( 6 - 6 ) reduce 
to Raoult’s law: 


K Pa n a 

A-t.o Pa,0 N a +N b ’ 

(6-7) 

'Vb _ Pb _ N b 

Pb, 0 N a 4* N b 

(6-8) 


7. The constant p. It is noteworthy that it has not been necessary to evaluate 
p to determine how the partial vapour pressures depend on the composition of the 
mixture. It was sufficient to know that p is independent of N A and N s . It is 
nevertheless of some interest to determine p, since it is related to the number of 
possible ways of arranging N A molecules A and N B molecules B on N s = rN A +N B 
sites. This number, which is denoted by g rjX (N A , N s ), is related to the thermo¬ 
dynamic functions by 

~W = ~W = l°g ffr,l(X A , N b ) + N a \ogf A + N b log/*. (7-1) 


Comparing (7-1) with (5-8), one finds, 
log 9r.i(N A ,N B ) = — log N a !—log N b ! 

+^log (qN A +N B )! — ( ~~ log (rN A +N B ) \+N A logp, (7-2) 

To determine the significance of p, consider the case rN A <^N B when, by neglecting 
(r — q) N a compared with N s = rN A + N B , (7-2) becomes asymptotically equivalent to 

log g r>1 (N A ,N B ) = log (tNa tr ^ b)Na + N A log p, (7-3) 

* 


or 


9r,l(N A 3 N s ) 


(tN a +N b )^ 

p 


(7-4) 


Now provided rN A <^N B , the factor multiplying p N * on the right of (7*4) is the 
number of ways of choosing N A out of rN A + N B sites. It may therefore be regarded 
as the number of ways of choosing sites for one selected element in each of the N A 
molecules. Consequently p must denote the number of distinct configurations of a 
molecule A when one of its elements is fixed on a given site. 

Now that the physical significance of p is known, it is possible to specify its value 
in simple cases. Thus in the trivial case of molecules occupying a single site r = 1, 
p = 1 . For molecules occupying two sites p = zjcr, where <r is the symmetry number 
of the molecule. Substituting zjcr for p in (7*2) one recovers formula* (821,3) of 
Fowler & Guggenheim ( 1939 ), which in turn is equivalent to Chang’s formula (33) 
of 1939. 

For molecules occupying three sites, there are three distinct cases. If the 
molecules are linear and rigid p = zjcr. If the molecules are non-linear and rigid 



Statistical thermodynamics of mixtures 211 

p = zz'/cr, where z f is the number of alternative sites for the third element when the 
first two elements have been placed. If the molecules are flexible p = z(z- 1 )/er. 

For molecules occupying more than three sites there are numerous alternatives. 
The minimum value for p is z[cr for rigid straight molecules. The maximum value 
for p is approximately z(z — l) r_2 /cr for entirely flexible molecules, but this is slightly 
inaccurate because it includes certain configurations in which the long molecule 
bends back on itself and two elements occupy the same site; such configurations 
should of course be excluded, but the error due to their inclusion is probably small. 
Some of these values of p are discussed by Miller ( 1942 , p. 116), who also pointed out 
that a knowledge of p is not required for the determination of the dependence of the 
partial pressures on the composition of the mixture. 

In the values assigned to p I have throughout included the factor 1 /or so as to 
conform with the usage adopted by Chang and Miller. It is, however, a matter of 
convention whether this factor 1 /cr is included in / or in p. It must be inserted in 
one, but not in both. 

8 . Generalization. The method of approach described above is so simple that 
it can be applied without any difficulty to the most general system containing any 
number of different types of molecule each with its own geometric properties. Let 
the molecules of type i be N t in number; let each such molecule occupy r i sites; let 
the number of sites which are neighbours of a molecule of type i be q^\ and let the 
number of alternative orientations of a molecule of type i be p i when one of its 
elements has already been fixed. The qf s and r/s are related by 

$*(»■<-&) = u- 1 ( aU *)• ( 8-1 ) 

Let the partition function of a molecule of type i, when attached to a prescribed 
set of sites be/*. The procedure is so straightforward and so precisely analogous to 
that already used, that the steps can be left to the reader and only the results need 
be quoted. If g(N f ) denotes the total number of possible arrangements of the 
molecules, then 

log g(N t ) = \z log (Z^N,) ! - log JNJ! - (fc - 1 ) log {E^N,) 1+2.N, fog Pij (8 . 2 ) 

W = lcT~ -log^W-^Wog/i- M 


From this by differentiation with respect to one derives 

Pj iJj ~ ~ ' 1 ’ 
If in particular the molecules of type 1 occupy only one site, so that r, = 1, q x — 1, 
Pt ~ 1, then , , Kt&rJV** 

Wl w (85) 

From (8-4) and (8-5) it follows that 

— w w> • 1 ' 



212 


E. A. Guggenheim 

using (8*1). The relation (8-6) is the extension of (4-6) and can be derived by the 
same elementary kinetic reasoning. 

The general formula for the partial vapour pressures is 

Pi. _ , 8 7 x 

nr ' ( ) 

Formula (8*7) is the generalization of Raoult’s law for molecules occupying various 
numbers of sites, 

9. Degree of accuracy! As this paper is concerned only with systems with zero 
energies of mixing, there is no error due to the assumption of complete random¬ 
ness of mixing. There is however a concealed error in my interpretation of complete 
randomness in the neighbourhood of a site occupied in a specified manner. Consider, 
for example, a site P neighbouring a site Q assumed to be occupied by an element 
of a molecule A. Then in the argument I have assumed that all conceivable manners 
of occupation of the site P, not prevented by the occupation of the site Q, are 
equally probable. Actually I have committed an error in not excluding those 
manners of occupation of the site P such that some element of the molecule 
occupying the site P is competing for some site R with some element of the 
molecule A assumed to be occupying the site Q. It is prohibitively diffi cult to 
assess the seriousness of this inaccuracy, but I believe it to be small. 

It should be noted that the whole treatment applies strictly to a well-defined 
lattice, and it remains uncertain to what degree of accuracy the treatment may be 
applied to liquid mixtures. There seems, however, no reason to doubt that for liquid 
mixtures of molecules with the geometrical properties assumed formula (8*7) should 
be more accurate than Raoult’s law. 

I am indebted to Sir Ralph Fowler, F.R.S., for helpful and constructive criticism. 


References 

Chang 1939 a Proc. Roy. Soc. A, 169, 512. 

Chang 19396 Proc. Gamb . Phil. Soc. 35, 265. 

Fowler & Guggenheim 1939 Statistical Thermodynamics. Cambridge University Press. 
Fowler & Pushbrooke 1937 Trans. Faraday Soc. 33, 1272. 

Guggenheim 1932 Proc. Boy. Soc. A, 135, 181. 

Guggenheim 1933 Modern Thermodynamics , p. 103. Methuen. 

Guggenheim 1937 Trans. Faraday Soc. 33, 151. 

Hildebrand 1924 Solubility , p. 26. Chemical Catalog Co. 

Lewis 1908 J. Amer , Chem. Soc. 30, 668 . 

Miller 1942 Proc. Gamb. Phil. Soc. 38, 109. 

Miller 1943 Proc. Gamb. Phil. Soc. 39, 54. 

Van’t Hoff 1903 Vorlesungen uber Theoretische und Physihalische Ghemie , Part 2 . 
Washburn 1910 Z. phys. Ghem. 74, 537. 



Statistical thermodynamics of mixtures with non-zero 
energies of mixing 


By E. A. Guggenheim, M.A., Sc.D., Imperial College of Science. 
(Communicated by Sir Ralph Fowler , F.R.S.—Received 5 April 1944) 


A combinatory formula is obtained for g(N it X i5 ) 9 the number of ways of arranging a mixture 
of any number of kinds of molecules on a lattice, the values of N t and X i5 being specified, 
where N t denotes the number of molecules of type i 3 z denotes the number of sites which are 
neighbours of one site, and zX i5 denotes the number of pairs of neighbouring sites occupied one 
by a molecule of type i, the other by a molecule of type j. Each molecule of type i is assumed 
to occupy r { sites, where r* is any integer with different values for different types of molecules. 

This formula is used to derive the thermodynamic properties of mixtures of molecules 
occupying various numbers of sites, assuming that the intermolecular energy can be regarded 
as a sum of terms, each pair of neighbours contributing one term. 

For binary mixtures the formulae obtained are very similar to those previously obtained 
for ‘regular’ solutions where each molecule occupies one site. A rather simple formula is 
obtained for the critical temperature and the composition of the critical mixture. 

The degree of accuracy of the treatment is the same as Chang’s use of Bethe’s first approxi¬ 
mation and as the ‘quasi-chemical’ method of approach. A brief investigation of a higher 
approximation for a binary regular mixture on a close-packed lattice indicates that the errors 
due to the approximation used are unlikely ever to be serious. 

1 . Introduction. In a former paper ( 1944 ) I derived a general formula for g(Nf), 
the total number of distinguishable ways of arranging on a lattice a mixture of any 
number of kinds of molecules differing amongst themselves in the number of sites 
occupied by one molecule, the number of molecules of type i being N { . From this 
I deduced the thermodynamic properties of such a mixture subject to the restriction 
that all the configurations had the same energy, or in other words that the energies 
of mixing were all zero. ' . 

The thermodynamic properties of such a mixture having non-zero energies of 
mixing are not determinable from giNf), but from the more complicated quantity 
g(N t , W) denoting the number of distinguishable configurations of intermolecular 
energy W. The configurational part of the free energy is equal to 

- JcT log Z{g(Ni, W) exp (- WjkT)}, 

where the summation extends over all values of W. As usual, for macroscopic 
systems there is no sensible loss of accuracy if the sum S { } is replaced by its maxi¬ 
mum term, say g{N it W) exp (— W jkT), where W is the value of W which maximizes 
g(N i ,W)exp(—W/kT) and is the configurational thermodynamic energy of the 
system in equilibrium. 

The evaluation of g(N i; W) by direct methods is prohibitively difficult, but the 
following considerations are sufficient to determine unambiguously the form of 
9W, W). 



214 


E. A. Guggenheim 

In the first place {log g(N u must be homogeneous of degree zerp in the 

N/s, so as to ensure that the free energy of a system of given temperature and 
composition shall be proportional to the size of the system. 

In the second place when W is zero log g(N i , W) must, apart from terms negligible 
compared with N i} reduce to log g(Ni) so as to ensure that the free energy is a con¬ 
tinuous function of W in the neighbourhood of W = 0. 

In the third place I shall assume that the conditions for maximizing 

g(N i ,W)ex V (-WJkT) 

with respect to W are equivalent to conditions of a ‘quasi-chemical equilibrium’ 
(Guggenheim 1938) between the alternative methods of occupation of a pair of 
neighbouring sites. This is an assumption which has proved fruitful in the simple 
case where each molecule of whatever type occupies a single site (Guggenheim 
1935; Rushbrooke 1938). In order not to interrupt the argument this assumption 
will be used at an early stage, discussion of its significance being postponed to §§ 12 - 14 . 

Having obtained a formula for g(N it W) and having maximized 

gW, W) exp (- WjkT) 

with respect to W, the thermodynamic properties of the system are formally deter¬ 
mined. The equations of quasi-chemical equilibrium for maximizing with respect 
to W^O can, however, be solved explicitly only for binary mixtures. For these the 
thermodynamic functions can be expressed in a form having a close resemblance to 
those previously obtained for ‘ regular ’ binary mixtures of molecules each occupying 
a single site. The formulae for binary mixtures include as a special case those given 
recently by Orr (1944) for a mixture of two kinds of molecule, of which those of one 
kind occupy only one site each. The present work was in fact stimulated by Orr’s 
paper, which I saw before its publication. 

2. Notation and counting operations. Let denote the number of molecules of 
type i. Let each molecule of type i occupy r i sites. Let the number of sites which 
are neighbours of any one site be z. It will be assumed that the structures of all the 
molecules are such that the elements which occupy a single site are interconnected 
in the molecule as a simple chain or a branched chain, but that there are no closed 
rings in the molecule. It is convenient to define parameters q i9 one for each mole¬ 
cular type, by the relation 

= 2(^-1) (afi i). (2-1) 

The physical measuring of q t is that g t z is the number of sites neighbours of the r i 
sites occupied by a molecule of type i, excluding those neighbours of each site occu¬ 
pied by the next element of the same molecule. Apart from configurations of a long 
molecule bent back on itself so that ‘ distant ’ elements of the same molecule occupy 
. neighbouring sites, the definition of q t could be simplified as follows: q t z is the number 
of sites which are neighbours of a molecule of type i. 

Let f t denote the partition function, omitting symmetry numbers, of a molecule 
of type i. attached to a given set of sites./,- being assumed independent of this set of 



Statistical thermodynamics of mixtures 215 

sites. This assumption is accurate provided each possible internal rotation in a 
molecule is either unrestricted or completely restricted. 

Let o , i denote the symmetry number of a molecule of type i. Let p i denote the 
number of alternative configurations of a molecule of type i when one of its elements 
is fixed on a given site. When r i = 1 the value of p i is 1. When r i ^ 2 the minimum 
value of p i is z/cr i for rigid straight molecules. The maximum value of p i is nearly 
z(z — l) r “ 2 /cr* for completely flexible molecules. The last mentioned value of p i is 
actually somewhat greater than the maximum possible because it includes certain 
configurations in which a molecule is bent back on itself so that two e distant ’ elements, 
occupy the same site; such configurations should of course be excluded. 

It is assumed that in any given configuration of the system the intermolecular 
potential energy W consists of a sum of terms, each pair of neighbouring sites 

occupied by elements of different molecules contributing one term. Let denote 

the contribution to the potential energy by a pair of sites of which one is occupied 
by an element of a molecule of type i, the other by an element of a different molecule 
of type j. In order to achieVe brevity in the formulae the energy zeros are so chosen 
that w u = 0 for all i. If it is wished to remove this restriction, it is merely necessary 
to replace w ii in all formulae by Wy — \w u — 

Let the number of pairs of neighbouring sites occupied by different molecules of 
which one is of type i, the other of type j be denoted by zXy . Then according to 
these definitions 

2zX iii +£') C zX i j c = ( 2 - 2 ) 

where E k denotes summation over all Jc^i. A more convenient form of (2*2) is # 

2X u ^ qi N^r k X ik . (2*3) 

The total intermolecular potential energy W of the system is then given by 


If = • (2-4) 

where Ey denotes summation over all distinct pairs of types of molecules. 

3. Quasi-chemical equilibrium . If one uses the symbol (ij) to denote a pair of 
sites occupied by elements of two different molecules, one of type i the other of 
type j, and considers a shuffling of the molecules such that two (ij )’s are created and 
an (ii) and a (jj ) are destroyed, one can write this symbolically 

{ H) + {jj )^ 2 (ij). (3*1) 

Formula (3*1) recalls the formula for a chemical reaction. It is not unreasonable to 
expect that the equilibrium condition between (ii), (jj) and (ij) should take a form 
similar to that corresponding to a chemical reaction. If (3*1) represented a chemical 
reaction between molecules, the equilibrium condition would be 


3 


— e -2 WijlkT 




( 3 - 2 ) 



216 


E. A. Guggenheim 


where the molecular partition functions of ii, jj and ij are assumed to be identical 
except for symmetry factors of 1 in the first two. If a formula such as ( 3 - 2 ) is applied 
to the number of pairs of neighbouring sites occupied in several ways it is con¬ 
veniently referred to as a condition of ‘ quasi-chemical’ equilibrium. 

An equation of quasi-chemical equilibrium similar to ( 3 - 2 ) was first proposed for 
‘regular’ mixtures of molecules each occupying a single site (Guggenheim 1935 ). 
An error in this first use of the equation was pointed out by Rushbrooke ( 1938 ), 
who at the same time showed that the equation could be derived from the approxi¬ 
mate formulation of grand partition functions usually referred to as Bethe’s method 
It was later shown that the equivalence between the equations of quasi-chemical 
equilibrium and those of Bethe’s method were quite general for molecules or atoms 
each occupying a single site, not only for all ‘regular’ systems (Guggenheim 1938 ), 
but also for systems with long-range order (Fowler & Guggenheim 1940 ). It is 
therefore not surprising to find that the two methods are also equivalent for systems 
of molecules each occupying several sites. To avoid a long digression in the reasoning, 
discussion of this equivalence is postponed to a later stage in § 12 . 

It is convenient to substitute from (2-3) and a similar formula for X^ into ( 3 - 2 ), 
thus obtaining for the conditions of quasi-chemical equilibrium 


X: 


= e -2 Wij/kT 


(3-3) 


(?{ X—£'ic X ik ) (qj Nj — Z' k Xj fc ) 

If one denotes by X§ the equilibrium values of X tj in the particular case that all 
energies of mixing are zero, that is to say all w i} — 0 , then 


Xf = (q i N i -r k X$ k )(q ] N j -Z' k X? k ) (i+j). 


The relevant solution of (3-4) is 


ij ~ X k q k N k 


)j 


(3-4) 

(3-5) 


* _ (gij%> 
W 


4. Combinatory factor g(N { ). The total number g(N t ) of distinguishable ways of 
arranging all the molecules, of each species i, on X i r i N i sites has been shown in 
the previous paper ( 1944 ) to be given by 


logsW = izlog (2M)!-(i*-1) log (X^X) l-X.logNf. + XNtlogp,, (4-1) 


where each q i is related to the corresponding r i by ( 2 - 1 ). 

5. Combinatory factot g(N. j, X iS ). It is now possible to construct an expression for 
the number of distinguishable configurations of all the molecules having a total 
intermolecular energy W. In the introduction I called this number g(N i} W), but 
W is related to the X^’s according to (2-4), and it will be more convenient to consider 
this number as a function of the N/s and the X H ’s. I shall accordingly henceforth 
call it g(N t , X^). 



Statistical thermodynamics of mixtures 217 

The formula for g(N i; X 1} ) determined by the considerations enumerated in § 1 is 

where gifstf) is given by (4- 1 ). It can now be verified that g(N t , Xq) as defined by (5- 1 ) 
does in fact have the properties enumerated in § 1 . 

In the first place if one maximizes 

log {g(N i} Xq) er***} - log Xq) ( 5 - 2 ) 

with respect to Xq, the condition obtained is 

¥log (q^-X' k X ik ) + Iz log -27 ’ k X jk )-z log Xq - zWq/kT = 0 (i#j), (5-3) 

__ n 


or 




_ g—ZWfjlkT (ijkj) } 


(5-4) 


which is identical with (3-3). This confirms that the maximizing conditions when 
g(N i , Xq) is given by (5-1) are of the required quasi-chemical form. 

In the second place it is seen from (5-4) that the Xq’s are homogeneous of the first 
degree in the N t ’ s, from which it follows that {log g(N { , Xq )} jE i N t as given by ( 5 - 1 ) 
is homogeneous of zero degree in the Nf s as required. 

Finally, from the definition of the Xq’e it is evident that in the special case that 
all to# = 0 , that is to say all energies of mixing zero, g(N t , Xq) reduces to p(i^). This 
completes the justification of formula (5-1) for g(N t , Xq). 

When the values of X* } given by (3-5) and the value of g(N { ) given by ( 4 - 1 ) are 
substituted into (5-1), this can be rewritten as 


log g(N { ,Xq) = 2:, N t log Pi - X, log N t ! + log feiV £ )! 

- (|z -1) log (Z t r t ty ! - izXi logM - Z' k X ik ) ! - *2^ log X v l. (5-5) 

It is worthy of notice that the term \z\og{S i q i N i )\ in loggr(^) is cancelled by 
an equal but opposite term coming from the denominators of (3-5) and (3-6). 
Consequently ,g{N i: Xq) contains no term in log (2^2^)!. Formula (5-5) is the 
fundamental one in the present treatment. 

6. Thermodynamic functions. The free energy of the system is, at negligible 
pressures, related to g(N it Xq) by 

F = - kT£ t N t log/i - JcTlog g(N t , Xq) + W 

— ~hTX i N i \ogf i —hT\ogg{N i ,Xq)+zS'qXqW i p (6-1) 

where Xq denotes the equilibrium value of Xq determined by equations (3-3). 
Substituting from (5*5) into (6-1), one obtains 

FfkT = - X.N, logf iPi +2;- log N t ! - *E t log \ + {\z- 1) log (Z^y. 

+ ^zX i log(q i N i —X k X ik )\ + zXq]ogXq\+zi:'qXqWq. ( 6 - 2 ) 



218 


E. A. Guggenheim 

The partial potentials /t t and the absolute activities A* (Fowler & Guggenheim 1939 , 
§ 224) can be obtained by differentiating ( 6 - 2 ) with respect to N it and in so doing 
variations in X^ may be ignored since it is known that dF/dX^ = 0 for all i,j. 
It is thus found that 


or 


Mi/kT = log A* = — log ^ + log N t - zq i log 

+ (¥ -1) r< log SftfNf + \zq x log (q t N t - X' k X ik ), 


V 


fipi 




This can be rewritten as 


N t (X j r j N j )^i(2X ii )im 

by using (2-3). Finally, .by using ( 2 - 1 ) the alternative formula 


A* 




) 


(6-3) 

(6-4) 

(6-5) 

( 6 - 6 ) 


fiPtZjTjNf \ {qiNJ* 

is obtained. 

In the particular case of all energies of mixing zero, X u becomes X* { , and using 
( 3 - 6 ) in ( 6 - 6 ) one obtains 


\ _ Ni IXjrjNA^j 


(6-7) 


which is formula (8-4) of my previous paper ( 1944 b). 

7. Partial vaqpour pressures. From ( 6 - 6 ) one deduces the relation between the 
partial vapour pressure^ of the component i in the mixture and the vapour pressure 
p i 0 of the pure substance 


Pi _ r i N i 1 

lq i (Z j r j N j )2X ii \ 

l is3i ^iNi 


p i0 XjrjNj 1 

{ J 

f ” 

{ j 


In this formula and in the formulae of the preceding section the X ik ’s have the 
values determined by the quasi-chemical equations (3-3). Since, however, these 
equations can generally not be solved explicitly, no further simplification is possible 
except in the case of a binary mixture, which will be considered in greater detail. 

8 . Combinatory formula for binary mixtures. When there are only two types of 
molecules, say A and B, in the mixture, formula (5-5) becomes 

„ (N N __ 

- NJ1 ^ !{( ^^4-r B A B )!}i-H(^-Z)!p{Z!}*{(g B iV B -Z)!p’ 

( 8 - 1 ) 

where X has been written fpr X AB . 

Formulae equivalent to two special cases of ( 8 - 1 ) have previously been obtained 
by Chang ( 1939 ). The first case was the simplest possible one, 




Qa — — Pa — Pb — 1 - 



219 


Statistical thermodynamics of mixtures 


The second case treated by Chang was 

r A ~ Qa — 2 (z— 1 )/z, p A = z/2, r B = q B = p B = 1. 

In the case of a binary mixture there is only one equation of quasi-chemical 
equilibrium, namely, 

■X 2 = (uNj. ~ X) (VbN b -X) e-w, ( 8 - 2 ) 


where w has been written for w AB . It is convenient to define a fraction x by 


and a quantity /? by 


= IaEa 

QaXa + q. b x b 


l — x = 


Qb^b 


QaNa+QbNb’ 


(8-3) 


J3 — {(l — 2x) 2 + 4x(l—x)e* wlkT } i . (8-4) 

It is to be noted in the simplest case of ‘regular’ solutions q A — q B — 1 and x is the 
mole-fraction of A. 

The solution X of the quadratic 'equation (8-2) is conveniently written in the form 


X _2a:(l — x) 

iNa + Qb^b /?+ 1 
from which follow __ 

<1a N a — X _ x(fi-\ + 2x) 
VaNa+QbNb 1 


(8-5) 

( 8 - 6 ) 


Qb^b- X _ (l-g)Qg+l-2g) 

VaNa+VbNb ~ ' fi+l • , * ' 

9. Thermodynamic functions for binary mixtures. The neatest, if not the most 
direct, way to obtain a compact formula for the free energy is via the absolute 
activities X A and X fi . For a binary mixture formula (6-6) becomes 


X A V 


Na 


A IaPa( r a+ r B X B ) l 

A _ Ee^^ 

* fBPB(r A NA+r B N B )\ 


\ { r A N A + r iiX B ) (qjLNj-X) \»u 

(U^) % i ’ 

( (r A N A + r B N B ){q B N B -X) Y*‘» 


) 


(ZbNb? 

Substituting from (8-6) and (8-7) into (9T) and (9-2) respectively, one obtains 


(9-1) 

(9-2) 


A El \ (r A N A + r B N B ){f-l + 2x) \^u 

A fAPA(rAN A + r B N B )\ + / ’ ( a) 

A = __ Ee I (JaEa±1bNb)JI± l-2g) 

B fBP^ A N A + r B N B ) \(q A N A + q B N B ) (f+1) (l-x)j ‘ ^ > 

If the subscript 0 is used to denote the value of a quantity for the substance in 
the pure state, (9-3) and (9*4) can be rewritten as 


A^ Pa Ed { (N A +r s N B /r A )(/]-l + 2x) \^ 

h A ,o Pa.o (NA + r B N B lr A )\ (N A + q B N B lq B )(j3+l)x j ’ { > 

Ab _ Pb _ Ee. f (r A N A l r B+N B ) (fi+l — 2x) | izq * sfa.fi) 

h B ,o~P b,o~ (rANAlr B +N B )\(q A NAlq B +N B )(/!l+l)(l-z)j * 



220 


E. A. Guggenheim 

In the simplest case r A = r B — q A — q B = 1 , these formulae reduce to those given 
by Fowler & Guggenheim ( 1939 , § 819). The formulae for the case r B — q B = 1 , 
r A > 1 have been derived in a recent paper by Orr ( 1944 ). 

The free energy at negligible pressures is obtained conveniently from the thermo¬ 
dynamic relation 

FjkT - N a log A^ + N s log A b . (9-7) 

The result can he written 

F = i?i + F 2! • (9-8) 

where FjkT = - log g(N A , N B ) - N A log f A - N s log f B 

= -N a log f A p A - N b log f B p B 4 - log N a ! 4 - log N b ! 

- \z log (q A N A + q B N B ) ! 4 - ( \z - 1 ) log (r A N A + r B N B ) ! (9-9) 


and F 2 /kT = U{q A N A +q B N B ) {slog + (!-*) log f ) } • O^O) 

* 

In the case of zero energy of mixing jS = 1 and F 2 vanishes, whereas F 1 is independent 
of the energy of mixing. Thus F 2 may be regarded as the part of the free energy due 
to the energy of mixing. The same formula for the free energy can be obtained, 
though more laboriously by using (8*1) and (8*5) in 

F/kT - - N a logf A - N B log/* - log g(N A , N B ,X)+ zXwjkT. (9*11) 


10. Energy of mixing . Having obtained formulae for the free energy, one can 
derive the energy according to the usual thermodynamic formulae by differentiation 
of F/T with respect to 1 jT. The only terms in FJT which depend on the temperature 
are' — N A logf A —N B logf B , and the contribution from these terms to the energy 
remains unchanged if the mixture is separated into its components. Hence the 
energy of mixing E is due entirely to FJT and is given by 


I dF% dp 
Td/3d(l/T)' 


( 10 - 1 ) 


By straightforward, if lengthy, algebra it is found that 


f v _ (?_ ir (10 ' 2> 

1 fj£ _ w /? 2 -(l~2a:) 2 , . 

Td(l/T) kT p 1 ’ 

whence 

E = zw{q A N A + q B N B ) 2 -^p. (10-4) 

Comparing (10*4) with (8*5), one finds 

E = zXw = W, (10*5) 


as might be expected. This circuitous derivation of the physically obvious (10*5) 
is a useful check on the self-consistency of the formulae. 



Statistical thermodynamics of mixtures ■ 221 


11 . 

is that 


Critical mixing. A necessary condition for the internal stability of a mixture 


1 dy A aiogA^ 

hT dx dx 


> 0 . 


( 11 - 1 ) 


If at a given temperature this inequality is not satisfied by all mixtures, then some 
mixtures will be unstable and will split into two phases of different composition. 

Differentiating ( 9 - 3 ) with respect to x, using (8-4), one obtains an expression which 
eventually reduces to 


9 lQ g 1 [zq A (z~2)r A r B /q B ) 

dx 2x \ r A x/q A + r B ( 1 - x)/q B j ' 

For the sake of brevity two new constants a and b are now defined by 


„ zc 1a Wa 

(z-2)t a zq A — 2 ’ 


(11-3) 


b = Z M = 

(z — 2 ) r B zq B — 2 ’ 

From ( 11 - 2 ) it follows that the equation 

dlog\ A /dx = 0 

reduces to 


ft = bx+a{\ — x). 

Combining ( 11 - 6 ) with (8-4), it is further found that (11-5) is equivalent to 


(11-4) 


(1.1-5) 

( 11 - 6 ) 


( 6 2 — 1) x 2 — 2(2e 2wlkT -1 - a 6 ) x{\- x) + (a 2 - 1) (1 - x) 2 = 0 . (11-7) 

At any given temperature some mixtures will be internally unstable if (11-7), 
regarded as a quadratic in £ or in xj(l —x), has real roots. The condition of critical 
mixing obtains when (11-7) has two equal roots. Hence using the subscript c to 
denote critical values, it follows that 


= l{l + ab + j[(a 2 -l)(b 2 -l)]}, ( 11 - 8 ) 


x c fa 2 -1 
l—x c a/ 6 2 — 1 ’ 


(11-9) 


By using the definitions (8-3) ofx, (ll-3)ofa and(ll-4) of 6 ,(11-9) can be rewritten as 


l u *- 1 

S B Jc r A q A *Jq B z-r 


( 11 - 10 ) 


The simplest example of these formulae corresponding to r A = r B = q A = q B = 1 , 
® = b = zf(z —2) was obtained by Rushbrooke (1938). The next simplest example 
corresponding to r A = 2 ,q A = 2 (z- l)/z, r B = q B = 1, a = (z-l)j\z- 2 ), b = z/(z-2) 
was obtained by Chang (1939). 

12. Quasi-chemical equilibrium and Bethe’s method. The treatment generally 
known as Bethe’s method consists in the construction of an approximate grand 


Vol. 183. A, 


15 



222 


E. A. Guggenheim 

partition function for one (or sometimes two) central sites and their neighbours. 
The equivalence between Bethe’s method and the quasi-chemical equations has 
been proved for molecules each occupying a single site (Rushbrooke 1938 ; Guggen¬ 
heim 1938 ; Fowler & Guggenheim 1940 ). I shall now show that for the most general 
kind of mixture of molecules each occupying any number of sites Bethe’s method 
always leads to the quasi-chemical conditions of equilibrium. More precisely I shall 
show that these quasi-chemical conditions are implicitly assumed in Bethe’s method. 

I shall give a detailed proof only for occupation of sites by either A or B molecules. 
The proof for the general case is precisely similar, but requires a more elaborate 
notation. 

In Bethe’s method an approximate grand partition function is constructed for 
one (or two) central sites and their neighbours. There is one term in this grand par¬ 
tition function for each distinct manner of occupation of all the sites. Let (A, A, 8) 
denote the term in the grand partition function corresponding to a selected central 
site being occupied by an element of an A molecule, a selected nearest neighbour 
being occupied by an element of a different A molecule, and all the remaining sites 
central and neighbours being occupied in some specified manner denoted symbolic¬ 
ally by 8. Let (B, B, 8) be similarly defined. Let (A, B, S) denote the term corre¬ 
sponding to the selected central site being occupied by an element of an A molecule, 
the selected neighbour site by an element of a B molecule, all the remaining sites, 
central and neighbours being occupied in the maimer 8. Let (B,A, 8) be defined 
similarly and correspond to the converse manner of occupation of the selected 
central and neighbour site. 

One of the essential approximations of Bethe’s method is the assumption 

{A, A, 8)/{A, B, 8) independent of 8,\ 

(B, A, 8)/{B , B, 8) independent of #.} 

According to this assumption the two Bethe parameters e and r\ can be defined by 

(A,A,S)KA,B,S)-eM, ( 12 - 2 ) 

(B, A, 8)/(B, B, 8) = erj. (12-3) 

It is usual to obtain further information concerning the relation between e and v) 
by psing the fact that the central site and its neighbour are physically equivalent. 
The following alternative procedure is simpler and at least as profitable. 

Sum ( 12 - 2 ) and (12*3) over all 8 obtaining 

(A,A)KA,B) = e/ V> (12-4) 

(B,A)/(B,B) = e V , (12-5) 

where {A, A) denotes the sum of all terms in the grand partition function corre¬ 
sponding to the selected central site being occupied by an element of an A molecule 
and the selected neighbour site by an element of a different A molecule, the {A, B), 



Statistical thermodynamics of mixtures 


223 


(B,A) and (B, B) are similarly defined. Now divide (12-5) by (12-4), thus eliminating 
e and obtaining 


(A,B)(B,A) - 
(A,A)(B,B)~ V ' 


( 12 - 6 ) 


Since the central site and its neighbour are physically equivalent, it is obvious that 


{A,B) = {B,A), 


(12-7) 


and consequently (12-6) is equivalent to 


{{A,B) + {B,A)Y 
4 (A,A)(B,B) V ' 


( 12 - 8 ) 


Now each term in the grand partition function is proportional to the frequency of 
occurrence of the manner of occupation to which that term refers. Moreover, there 
is no special significance attaching to the particular pair of sites selected as central 
site and its neighbour. The numbers of pairs of sites occupied both by elements of 
different A molecules, both by elements of different B molecules and one by an 
element of an A molecule, the other by an element of a B molecule are therefore 
proportional respectively to (AA), ( BB ) and {(A, B) + (B,A)}. Hence (12-8) is 
equivalent to the condition (3-2) of quasi-chemical equilibrium, provided 


rj = e~ w ^s /kT , 


(12-9) 


which is precisely the value assumed for r\ in Bethe’s method. 

To sum up, the two essential approximate assumptions of Bethe’s method are 
(12-1) and (12-9). The two assumptions together lead immediately to the quasi¬ 
chemical conditions of equilibrium. 

13. Nature of approximation. Neither the quasi-chemical method nor Bethe’s 
method is completely accurate for the following reason. In both methods it is 
assumed that having specified the manner of occupation of one site, say a, then the 
probabilities of occupation of one of its neighbours, say b, in alternate manners are 
independent of those for another neighbour site say b'. Actually the manner of 
occupation of b affects the probabilities of its neighbours, say c,c',..., and these 
affect their neighbour and so on. But b is related to b' not only through a, but also 
through closed rings bcd...b. Hence the probabilities of occupation of b and b' are 
not truly independent. 

The error due to this inexact assumption will be the more serious the shorter the 
rings connecting b to b' without passing through a. The error may be expected to be 
worst in a close-packed structure, in which b and b' can be neighbours of each other. 
An investigation of this worst case in the following section indicates that the 
inaccuracy is never serious. 

14. Higher approximation for close-packed lattice. An estimate of the degree of 
inaccuracy of our formulae in the case of a close-packed lattice can be obtained by 
applying equations of the quasi-chemical type to sets of three sites forming triangles 



224 


E. A. Guggenheim 

and comparing the result with that already obtained by its application to pairs of 
sites. I shall consider only the simplest case of a binary mixture of molecules A 
and B each occupying a single site. 

As previously, let z denote the number of neighbours of a single site. Let z' denote 
the number of triangles of three touching sites with any given pair of sites in common. 
Then the total number of triangles of three touching sites is (N A +N B )zz'/6. The 
numbers of triangles occupied in alternative manners now require for specification 
two independent quantities instead of X in the case of pairs of sites. Two such 
quantities 7 and Y' can be defined such that the number of triangles occupied in 
each manner are as given in the following table: 

AAA (N a -2Y-Y')zz'/Q 

AAB 37zz'/6 

ABB 37'zz'/6 

BBB (N b — Y—2Y') zz’/6. 

Furthermore, Y and Y' are related to X, defined in § 2, by 

X=7+7'. (14-1) 

The equations of quasi-chemical equilibrium for the triangles are 

Y'(N a -27-7')= 7 2 e 2,o/feT , (14-2) 

Y{N b - 7- 27') = 7' 2 e«. (14-3) 

These equations are generally intractable, but they simplify considerably for a 
mixture of equal numbers of A and B molecules. Then the solution obtained is 

7=7' = NJ( 3 + e™ kT ). (14-4) 


Denoting, as previously, by an asterisk the value of X for w zero, one obtains from 
(14-1) and (14-4) 

Y* = 3t I^’ (2nd approx ' ) ’ (14 ' 5) 

in place of the previous approximation of § 8 


X 2 
X*~ l + e wlkT 


(1st approx.). 


(14-6) 


The discrepancy between (14-5) and (14-6) will increase with w/kT and will be greatest 
(for N a = N b ) at the critical temperature, which according to the first approximation 
is given by 

e wikT c = » 1® (14-7) 

z — 2 10 v ; 


for a close-packed three-dimensional lattice. 



Statistical thermodynamics of mixtures 

Substituting from (14-7) into (14-6) and (14-5), one obtains 


225 


X 2 1 

X* ~2-20 ~ MO 


(1st approx.), 


(14-8) 


X_ 4 _ 1 

X * - 4-44 “ Ml 


(2nd approx.). 


(14-9) 


It is thus seen that in this particularly unfavourable case the two approximations 
differ by less than 1 % in the value obtained for X/X*. 


15. Zeroth approximation. It has already been mentioned that the quasi-chemical 
equations for the X. y ’s can be solved explicitly only for binary mixtures. It is 
therefore expedient to derive approximate formulae for mixtures of mpre than two 
components. It is actually easy to obtain formulae in which the inaccuracy is of the 
order {w^jkT) 2 . From the form of the quasi-chemical equations for the X,/s 
and from the definition of the X*/s, it is clear that all the X y —X# vanish when 
all the Wf/s vanish. A formula can therefore be obtained for the free energy 
F of the required degree of accuracy by expressing F(N i} X iS ) as a power series in 
(X { j—X*])IX*j and neglecting terms of order higher than the first. The Taylor 
series for F{N if Xy) is 


. • F(N it X v ) = F(N it X%)+Z is x 


(X i} - X%) + terms in Z i} {X i} - X%)\ 

(15-1) 


_ _ 

Since — F(N. t , Xy) is maximized with respect to X. y , it is known that 3 F(N t , X y )/3X y 
is small of the first order in (Xy - Xy)/Xy and the second term on the right of 
(15-1) is small of the second order in (Xy — Xy)/Xy. Hence to the required degree 
of accuracy 

F(N i ,X i} )^F(N i ,X%). (15-2) 

Substituting from (6-1) and using (5T) and (4-1), one obtains 


F = - kTX.Ni log/, - TcT log g{N u X%) + zZ'^Wy 
= - kTZ t N % log/, - IcTloggiNf) +*2^X>y 
= - kT£ t N t log f iPi + kTZ i log X,! - \zkT log (S iqt N t )! 


+ (Jz-l)&Tlog(X i r,X i )! + 


^Z jj g, Nf ([j Xj wy 


The corresponding formula for the energy of mixing E is 


_ zZ' ij q i N i q j N j w ij 


(15-3) 


(15-4) 



226 


E. A. Guggenheim 


By differentiating (15-3) with respect to N { one obtains for the partial potentials 
and the absolute activities X t 


Pi = kT log X { = kT log 


N, 


+ «?<' 


+ \zq i kT log&tS 


'Z’ i q j N i w 1s E' jk q i Nj q k N k w jk 


In the case of a binary mixture of A and B molecules (15-6) reduces to 
^a IaEa ( N a + r B N B /r A }^ _ fzq A w l q B N B \ 


__ ' I Al » -a \ PXT) l I — ■■■ I 

A^,o " r A N A +r B N B \N A + q B N B lqJ P \ kT \q A N A + q B N B ) 


(15-5) 


(15-6) 


correct to the first order in wjkT. 

As the treatment in §§ 1-13 is equivalent to Bethe’s first approximation, I have 
referred to it as the first approximation, and I have referred to the more accurate 
treatment in § 14 as the second approximation. The formula of the present section 
must therefore for the sake of consistency be called the zeroth approximation. This 
is in accordance with the terminology used for regular solutions by Fowler & Gug¬ 
genheim 1939 , § 819). The zeroth approximation corresponds to Hildebrand’s 
approximation for regular solutions and to Bragg and Williams’s approximation 
in the treatment of order-disorder in alloys (Fowler & Guggenjieim 1939 , §§ 819,1318). 

16. Formulae for z infinite. The maximum value of z, corresponding to closest 
packing, is 12, and greater values of z have no physical meaning. It is, nevertheless, 
of interest to consider how the formulae for a binary mixture behave when z-*oo 
and w-+0, keeping zw = u finite, because the free energy takes a strikingly simple 
form, previously proposed by Flory ( 1942 ) in the special case r B = 1. 

Some care is needed in proceeding to the limit, because, for example, although 
r—q tends to zero, \z{r — q) remains equal to r — 1. It is convenient to define ‘volume 
fractions’ x A , x B by 


r A N A 


r A N A + r B N B 


x B = 


r B^B 


r A N A + r B N B 


(16-1) 


After proceeding to the limit z-> 00 , it is seen that x —> x A and 1 — x —> x B , but care 
must be taken not to replace x by x A indiscriminately in terms containing z. 

The formulae eventually obtained are 


log— = log y"—■ = log X A + 
Pa, 0 X A> 0 


(r B -r A )x B r A u 




kT’ 


F-N A kT log X Afi —N B kT log A B>0 

= N A kTlogx A +N B kT log x B + (r A N A + r B N B ) x A x B u, 

E = (r A N A +r B N B )x A x B u. 

Differentiating (16-2) with respect to N B , one obtains 


Slog Pa 
dN B 


r A Nj+r\N B r A N A r B N B 2r 




r n u 


A r B 


ip A N A +r B N B f'(r A N A +r B N B f kT 


(16-2) 

(16-3) 

(16-4) 

(16-5) 




References 

Chang 1939 Proc. Camb. Phil. Soc. 35, 265. 

Flory 194 a J- Chem. Phys. 10 , 51. 

Fowler & Guggenheim 1939 Statistical Thermodynamics. Cambridge University Press. 
Fowler & Guggenheim 1940 Proc. Roy. Soc. A, 174, 189. 

Guggenheim 1935 Proc. Boy. /Soc. A, 148, 304. 

Guggenheim 1938 Proc. Boy. Soc. A, 169, 134. 

Guggenheim 1944 a Nature , Lond. y 153, 255. 

Guggenheim 19446 Proc. Boy. Soc. A, 183, 203. 

Orr 1944 Trans. Faraday Soc. 11, 320. 

Kushbrooke 1938 Proc. Boy. Soc. A, 166, 296. 



The initiation of long electrical discharges 

By C. E. R. Bruce, M.A., B.So., A.M.I.E.E. 

(Communicated by Sir Edward Appleton , F.R.S.—Received 15 October 1943— 

Revised 11 April 1944) 

The glow-arc transition theory of the first leader stroke of a lightning flash is further con¬ 
sidered, and shown to be in accord with all the observations available, whereas the recom¬ 
bination theory is not considered to be. The greatly reduced potentials already indicated to 
be adequate to cause flashes* of the order of only 1 % of those required in a homogeneous 
field, and the corona currents from the leader stroke channel, are confirmed by a discussion of 
the records of the currents in, and electrostatic field changes caused by, the first leader, which 
further indicates that formulae derived from laboratory studies of corona currents from 
wires and from pointed conductors can be extrapolated to lightning voltages, i.e. over a 
range of about 1000 to 1. 

1 . INTRODUCTION 

It was pointed out by Larmor (1914) that when electrical breakdown, of a gas has 
once been started in a non-uniform electric field, the resulting discharge will extend 
beyond the region throughout which the critical electric field strength required for 
breakdown was initially exceeded. This effect was discussed by Wilson (1920) and 
Simpson (1929), the former of whom considered that the potential difference required 
to cause a vertical lightning flash between a thundercloud and earth might be 
thereby reduced by a factor of not more than six below that which would be required 
if the field had been homogeneous. Though the value obtained in this way for the 
potential of a thundercloud is that usually quoted (see e.g. Schonland 1939; Loeb 
1939a; Allibone 1941),. there are grounds for believing that some workers in the 
subject doubted whether the average gradients between cloud and earth to which 
such a difference of potential would give rise do in fact exist. Some degree of precision 
was recently given to these latter ideas (Bruce & Golde 1941). 

The first indication that the pre-discharge fields are comparatively small was 
obtained when the charge distributed along the leader stroke channel was considered 
in relation to the latter’s distributed capacitance. This charge distribution was 
derived from an interpretation of oscillographic records of the current in a lightning 
flash, in the light of observations of the velocity of the return stroke, the latter 
having been obtained by Schonland and his collaborators ( 1934 , 1935 , 1938 ), from 
their photographs of lightning flashes taken with a Boys camera. The duration of 
the initial high-current portion of the lightning current wave is about 100 /esec., 
agreeing with the time required for the return stroke to reach the cloud charge, the 
charge distributed along the leader stroke channel having been neutralized during its 
passage. This latter charge is found by integration of the oscillograms to be about 
1 coulomb, while the capacitance of such a vertical conductor just not in contact 
with the earth and 2*5 km. long, from a formula given by Goodlet ( 1937 ), is about 
1*28 x 10 4 /^F. Its potential is therefore 7-8 x 10 7 Y, giving an average field between 

[ 228 ] 



The initiation of long electrical discharges 229 

cloud and earth, along the actual path taken by the lightning flash, of 312 V/cm., 
while the shape of the current wave and the distribution of capacitance along the 
conductor together indicate that the field is fairly uniformly distributed. 

Again, the results obtained by Schonland and his collaborators indicate that there 
are on an average about three successive strokes in a flash, while the average time 
interval between these strokes is about sec. Since the velocity of propagation 
of the leader stroke process is about 3 x 10 7 cm./sec., this indicates that the thunder¬ 
cloud charge is spread over an area of the order of 100 sq. km. This latter charge, 
the total charge brought to earth in a flash, is on an average about 30 coulombs, so 
that the average field is thus about 200 V/cm., and the potential of the thunder¬ 
cloud charge about 4-5 x 10 7 V. 

A similar conclusion as to the dispersion of the charge in the thundercloud is arrived 
at from a consideration of the duration of thunder. The data given by Wilson ( 1920 ) 
and Mathias ( 1924 ) show that in the case of near flashes this lasts for over 30 sec., 
and in some cases for over 40 sec., indicating that the diameter of the charged area 
is of the order of more than 10 km., in agreement with the above estimates. 

The average pre-discharge field between cloud and earth is thus apparently only 
of the order of one or a few hundred volts per centimetre, or only about 1 % of 
that required to cause breakdown in a uniform field. This estimate is, however, of 
the same order as that of the field changes and pre-discharge fields actually recorded 
at the earth (Wilson 1920 ; Wormell 1939 ; Simpson & Scrase 1937 ; Appleton & 
Chapman 1937 ), and as that of the-fields recorded during the ascent of test balloons 
through thunderclouds, when lightning flashes occurred nearby (Simpson & Scrase 
1937 ; Simpson & Robinson 1941 ). Through the courtesy of Sir George Simpson, the 
author was able to examine the records obtained in the last-mentioned investiga¬ 
tions, to see if there was any evidence for a gradual increase of the electric field with 
increase of height above the ground, but none was found. 

It has generally been considered that the individual strokes of a flash represent the 
successive discharge to earth of separate volumes of charge in the cloud (Schonland 
1938 ). This complication of the simpler distribution suggested above, would, how¬ 
ever, appear to be unnecessary, and is rendered improbable by the occasional occur¬ 
rence of flashes comprising large numbers of separate strokes, as many as 27 (Schon¬ 
land, Malan & Collens 1935 ), and even 40 (Larsen 1905 ) and 42 (Matthias 1929 ) 
having been observed in one flash. A likelier explanation would appear to be that 
each stroke results from the meeting of a leader progressing from the initial discharge 
channel into the cloud, with one which has originated at a local elongation or pro¬ 
tuberance from the main volume of charge, similar to that responsible for the 
initiation of the first stroke to earth. The charge deposited along this second leader 
will then be rapidly distributed along the channel to earth, giving the phenomenon 
of the continuous dart leader of,subsequent strokes (Schonland et al. 1935 ), and 
thereafter neutralized during the ensuing return stroke. In the absence of such local 
field concentrations, the channel or channels from the point of origin of the flash 
will simply progress through the cloud neutralizing the charge therein by a gradual 



230 


C. E. R. Bruce 


process (corona discharge), and it may be noted that the largest charge hitherto 
recorded oscillographically (McEachron 1939) was practically entirely conducted 
to earth in just such a relatively continuous manner. 

So far as the field changes caused by successive strokes are concerned, the above 
interpretation will give rise to the same phenomena as would be observed were the 
total' charge in fact initially segregated into separate pockets. Until further 
evidence is obtained, therefore, it would appear to be sufficient to consider the 
comparatively simple distribution of charge suggested above, and in what follows 
a mechanism is suggested to account for the propagation of an electrical discharge 
through a gas in which the field strength, apart from a very localized concentration 
at the point of origin of the discharge, is of the order of that suggested above, and 
in the process further precision is given to the values of the fields involved. 

2 . Importance of corona currents 

An important aspect of these very long electrical discharges has recently been 
pointed out (Bruce 1941), namely, that heavy corona currents must flow laterally 
from the leader stroke channel during its growth towards the opposite electrode. 
Since, apart from the relatively small potential gradient required for the main¬ 
tenance of arc conditions, the potential of the initiating electrode is being pro¬ 
pagated into the intervening space, the potential difference between the streamer 
and the surrounding space must build up to many millions of volts, even with the 
much smaller field strengths now obtained. Such potentials are in fact observed 
when the streamer makes contact with an overhead transmission line (Pittman & 
Torok 1931). It is shown herein (§ 6) that in the case of the lightning discharge these 
lateral corona currents must build up to the order of hundreds or even thousands of 
amperes, during the progression of the leader stroke. That such currents do, in fact, 
flow has been confirmed by direct oscillographic records of the current in the upward 
leader strokes from the Empire State Building (McEachron 1939), and also indirectly 
by records of the electrostatic field changes caused by the leader stroke of.nearby 
flashes (§§ 4 , 6). 

It would appear that the existence of currents of this order rules out the possi- 
• bility of the ageing of the ionization in the leader stroke channel, envisaged by 
Schonland (1938), and also the recombination theory of the stepped structure of the 
first leader stroke as it has been developed by Meek (1939) and Loeb (19396). 

It is also evident that the only type of electric discharge capable of carrying the 
large currents which are thus seen to flow steadily in the leader stroke is the arc 
discharge, and the realization of this fact has led to a new theory of the stepped 
leader stroke. 


3 . Mechanism of the stepped leader stroke 

The streamers which arise, for example, from the top of the Empire State Building, 
prior to the occurrence of a lightning flash, must start as brush or glow discharges 



231 


The initiation of long electrical discharges 

carrying minute currents, so that at some stage in their development when an upward 
stroke occurs from the Building, the sudden transition to arc conditions must take 
place. Again, if the fresh start of the pilot streamer (Schonland 1938) after the 
occurrence of a dart is considered, the same sequence must ensue. The current 
required to maintain the lateral corona must increase as the pilot streamer lengthens, 
so that an increasing current must flow through the tip of the preceding dart, or 
initiating point, until a total current of the order of an ampere is reached, at which 
point the sudden transition from glow to arc conductivity is known to occur (Todd 
& Browne 1930; Attwood, Dow & Krausnick 1931; Fan 1939). This condition is 
represented diagrammatically in figure 1, in which I T is this critical transition cur¬ 
rent. The reason for the complete breakdown of the whole additional length of the 


H IV 



A' B 


Figube 1. Diagram illustrating formation of leader stroke darts. 

(I T is the current required for glow to arc transition.) 

streamer, AB , to arc conditions at this point in its history, is that a position of 
instability has been reached, since the currents flowing along the streamer from A 
to some point A', near J 5 , are already large enough to maintain arc conditions, with 
a voltage gradient only about one-tenth of that required for the conduction* of the 
same current in the glow regime, if only arc conditions could be initiated therein. 
This follows from the observation (Attwood et al. 1931) that, when once established, 
arc conductivity is maintained down to currents of the order of only‘one-tenth of 
those required to cause glow to arc transition, before transition back from arc to 
glow occurs. The disturbance and redistribution of space charge caused by the 
initiation of arc conditions at A, is sufficient to trigger the change-over to the new 
conditions of stability throughout the newly formed channel, though the change will 
not be permanent beyond A'. It is this ‘hysteresis’ effect, so to speak, which accounts 
for the observed dart structure. 

Up to the instant of transition the characteristics of the newly formed streamer 
will be at best those of the glow discharge, i.e. the voltage gradient required to 
maintain it will be of the order of several hundred volts per centimetre, whereas at 
the .transition it will be suddenly reduced to the order of tens of volts per centi¬ 
metre, and will have a pronouncedly negative volt-ampere characteristic, so that 
as the current through it rises with the increasing length of the channel, the gradient 
required to maintain the discharge current will be still further reduced. Another 




232 0 . E. R. Bruce 

important feature of the new regime is that the streamer channel after the transition 
has the characteristics of an arc plasma, and is from most points of view simply an 
extension of the original conductor. It is this important characteristic, together 
with the greatly reduced potential gradient required for the maintenance of arc 
conductivity, which explains the 6 self-propagating 5 nature of the discharge. 


4 . El ectrostatic field changes caused by the leader stroke 

There are several ways in which the new theories of the corona currents and the 
mechanism of the leader progression can be checked quantitatively against the 
observed results. Perhaps the most striking is the comparison of the calculated and 
observed electrostatic field changes caused by the leader stroke. 

First, as regards the potential required to initiate such a discharge: The largest 
pre-discharge'field which has hitherto been observed is about 100 V/cm. (Wormell 
1939 ), i.e. well below the estimates given in § 1 above. The Empire State Building 
short-circuits, as it were, 1250 ft. of this field, so that if fields of this order existed 
around the Building, then the potential difference between its top and the sur¬ 
rounding space would be about 4,000,000 V. However, it will be shown in § 6 that 
flashes are apparently initiated by the Building in fields which would not normally 
result in lightning flashes from the cloud, the pre-discharge field being in the case of 
these upward flashes apparently only about 50 V/cm. With such fields the potential 
difference between the top of the Building and the surrounding space is only about 
2 , 000,000 V, so that this would appear to be the potential difference required to 
initiate a positive discharge in such a case, for so far only flashes in which the 
Building was the positive electrode have been observed. It will be shown in § 7 that 
a similar value for this critical potential difference is indicated by extrapolation 
from laboratory data on the magnitude of currents from pointed conductors, since 
the latter suggest that currents of the order of that required for the occurrence of 
glow to arc transition will flow from the point at these potentials. Though extra¬ 
polation over such a wide range of potential may appear at first sight uncertain, to 
say the least of it, nevertheless the agreement of theory and observation is rendered 
less surprising by what follows on the magnitudes of the lateral corona currents, 
which are likewise found to agree with the values obtained by extrapolation from 
laboratory data. 

Once the streamer is started, its potential, relative to the surrounding space 
builds up at the rate of X 1 V/cm. travelled, if the small voltage gradient required to 
maintain the aic, X ai is neglected, where X x is the value of the pre-discharge electric 
field between cloud and ground. At a point distant l from the start, it is therefore 
given by 

V^Vo+KX^X,)^ +ix x , 


where V 0 is the initial potential difference between the point at which the streamer 
originates, and the surrounding space. 



The initiation of long electrical discharges 233 

The corona current per centimetre length of the discharge channel at this point 
will be 

I c = HVo+.lXJ*, ( 1 ) 

assuming that this current is proportional to the square of the conductor voltage 
(Whitehead 1927), the potential which marks the onset of corona being in this case 
relatively negligible. The rate of change of electric moment at time t is therefore 

^ j\v 0 +lX^2ldl, ( 2 ) 

where v is the velocity of propagation of the leader stroke, and the total electric 
moment destroyed in time t is 

n vt 

^2 kl{V 0 +X x l) z dldh (3) 

The electrostatic field change at distance r from the flash due to this change of 
electric moment is given by 

6 s ~ ^3’ ( 4 ) 

in which ji and e s are in electrostatic units, so that 

Qv lrus ft fvt 

e s - II 2M(V 0 + XJ) 2 dldtV/m ., (5) 

l and r being in cm., v in cm./sec., V Q and in V and V/cm. respectively, and e 8 in V/m. 

No allowance has been made in the above for branching of the leader stroke, or 
for the fact that the general field through which the leader is advancing decreases 
somewhat as charge is lowered from the cloud by the leader stroke. As regards the 
former, more than half the total number of flashes appear to be unbranched (Schon- 
land & Collens 1934; Schonland et al . 1935), and for these the present analysis holds. 
The average number of branches in branched flashes is about 3 (Schonland et ah 
I 935), but not all these are in process of formation at the same time. It is therefore 
unlikely that allowance for branching would materially alter the results deduced 
herein. The effect would be to reduce still further the values obtained for the electric 
field. f 

The allowance to be made for the change of electric field strength during the 
leader stroke will also be small. This field change has been measured by Appleton & 
Chapman (1937) and by Schonland, Hodges & Collens (1938) who found it to be 33 
and 25 %, respectively, of the total field change due to the stroke. As there are on an 
average three strokes in a flash, the field change caused by the first leader stroke is 
thus only of the order of 10 or 20 % of the total change caused by the flash. Again, 
the increase of the estimated initial potential required to compensate for the decrease 
of the field during the progression of the leader stroke will be further reduced by the 
fact that the effects considered increase as the square of the potential difference 
between the channel and the surrounding space. 



234 


C. E. R. Bruce 


Thus allowance for these two factors would certainly not affect the order of the 
values deduced herein for the potential of the thundercloud charge, the effect of 
each being small and its direction opposite to that of the other. 

Oscillographic records of e s for near strokes have been obtained by Appleton & 
Chapman ( 1937 ), so that the above is an equation for k when V 0i X ± and the height of 
the thundercloud charge are known. This last varies somewhat, but may be taken 
to be nearly 2-5 km. (Simpson & Scrase 1937 ). The value of e s at the end of the 
leader stroke, i.e. immediately prior to the sudden change due to the return stroke, 
was measured in the two oscillograms given by Appleton & Chapman (their figure 6 , 
(i) and (ii)), $nd the values of h calculated in each case, corresponding to values of 
X x of the order of a few hundred volts per centimetre. The resulting curves are shown 
in figure 2 , from which k will be seen to be of the order of 10~ 16 or 10 ~ 17 . 



Potential of thundercloud charge (volts) 

Fiqtjre 2. Variation of corona constant h with potential of thundercloud charge. Curves 
based on figure 6 (i) and (ii) of Appleton & Chapman ( 1937 ). Full line, figure 6 (i); broken 
line, figure 6 (ii). 

That these two records represent fairly normal conditions is seen from a com¬ 
parison of the fields and durations involved with the average conditions found by 
these and other investigators. The flashes in question both occurred at about 10 km. 
distance, and the total field change caused by the stroke was about 1000 V/cm. in 
each case. The average value obtained by Appleton & Chapman at this distance was 
about 650 V/cm., while the average field change caused by a complete flash at the 
same distance was found by Wormell ( 1939 ) to be about 1100 V/cm., thus indicating 
that if anything the field changes in the two oscillograms in question are on the high 
side, but probably do not exceed the average by a factor of more than about 2 . 





235 


The initiation of long electrical discharges 

Again, as to the durations of the field changes, these indicate for the velocity of the 
leader stroke values of about 2*3 x 10 7 and less than 7 x 10 7 cm./sec. respectively in 
the two cases, as compared with Schonland’s mean value of about 3*8 x 10 7 cm./sec., 
and McEachron’s mean value of about 2*5 x 10 7 cm./sec. 

Several investigations of the corona currents from wires have been made and from 
them the value of h can be determined. Thus Watson’s data on d.c. corona ( 1910 ) 
yield a value of k = 4*2 x 10 ~ 17 , while Peek’s data ( 1915 ) give values of h somewhat 
higher than this. 

This agreement between the calculated and measured values of k would appear 
both to confirm the existence of these corona currents and the greatly reduced 
values of the thundercloud potential now obtained. 



Figure 3, Electrostatic field change due to leader stroke. Observed (Appleton & Chapman, 
1937 , figure 6 (ii)) and calculated. Points measured from oscillogram. Broken line, original 
calculation; full line, calculated with new time base. 

When the variation of e 8 with time was studied, there appeared to be a divergence 
between the calculated and observed values. (Compare the full and broken line curves 
of figure 3.) However, on further consideration it appeared that since, on the 
present view, a lightning flash results from a breakdown originating at a field con¬ 
centration in a field which is elsewhere fairly uniform, the records obtained of the 
change of electrostatic field with time should be geometrically very similar, provided 
that there is sufficient charge available to maintain the discharge, As it has just been 
seen that the field changes caused by the two strokes in question are both above 
the average, the latter condition would appear to be satisfied. Comparison of the 
aforementioned records (Appleton & Chapman 1937 , figure 6 (i) and (ii)) showed 
that such a similarity did not exist between these two records, and that the initial 
quiescent period in their figure 6 (ii), before the trace leaves the zero line, already of 
long duration, should be longer still. This at once suggested that the oscillograph 
had in this case only tripped some time after the lightning flash had actually started, 



236 


C. E. R. Bruce 


owing to the weakness of the initial pulses. A lower limit to this time lag was obtained 
from a comparison of the two records by making the assumption that no lag had 
occurred in figure 6 (i). The required increase in the time base of figure 6 (ii) was thus 
found to be about 3-4 msec., the total duration of the leader stroke being then 
10*8 msec. Using this new time base, complete agreement is obtained between the 
calculated and observed field changes, as is seen in figure 3, in which the circles 
represent the values measured from the oscillogram, while the full line curve is that 
calculated from equation (5) above using the new time base. The theory thus offers 
a satisfactory explanation of these records, which had remained unaccounted for 
since they were first obtained by Appleton & Chapman. 

5. Determination of the thundercloud potential 

It will be seen thatun the present theory there is no weE-defined potential required 
for the occurrence of a lightning flash, since the higher the average field the less will 
be the discontinuity necessary to initiate the ‘ self-propagating 5 discharge. Thus 
upward flashes wEl be initiated by high buildings in fields which are approximately 
inversely proportional to the heights of the buildings, and the same wiE apply to 
elongations of space charge in the cloud. Indeed, there would appear to be no reason 
why, in the extreme case, the average field need be much greater than that required 
for the maintenance of an arc, i.e. of the order of 10 V/cm. in air at atmospheric 
pressure, and it wiE be shown in § 6 that, in the case of flashes to the Empire State 
BuEding, the average field between cloud and ground is probably only of the order 
of 50 Y/cm. 

The results of the-previous section afford a method of determining the potential 
of the thundercloud charge in individual cases from records of the electrostatic field 
change during the leader stroke, if at the same time the height of the charge is 
determined by the method of sounding baEoons adopted by Simpson. From each 
of the former records, together with the known height of the charge, a curve corre¬ 
sponding to those in figure 2 can be derived, giving k } the corona constant, in relation 
to X v This k must be a constant characteristic of these long-discharge channels, 
the value of which wiE become apparent when more records are avaflable for study. 
When the value of k has thus been determined, the curve for each flash wiE then yield 
a value for the average gradient below the thundercloud charge for the flash in 
question. As an iflustration, had the heights of the charge in the two cases dealt 
with in figure 2 been found each to have the average value of 2*5 km,, then if we 
provisionaEy adopt the value of k found experimentaEy, viz. about 4x 10" 17 , the 
cloud potentials would have been 6-3 x 10 7 and 3*0 x 10 7 V respectively. From the 
combination of the results of the two investigations suggested above it wiE likewise be 
possible to determine the velocity of the leader stroke, and hence the variation of 
the latter with the initial field strength. The indications of the comparison of the 
above two records are that velocity decreases with decrease in field strength, and in 
this connexion it is of interest to note that the long durations observed by McEachron 



237 


The initiation of long electrical discharges 

( 1939 ) in the case of flashes to the Empire State Building may be due to this cause, 
the flashes having been initiated by the Building in fields which are smaller than those 
required to. cause normal flashes from cloud to earth. 

A further interesting deduction from the present theory of the initiation and 
propagation of a lightning flash is that the protective range of a lightning conductor 
or tall building will not be constant for all lightning flashes, but will increase with the 
potential difference between the leader stroke channel as it approaches the earth and 
the surrounding space. For different leader strokes along any one vertical path, the 
induced potential of the space immediately surrounding the lightning conductor 
will reach the value necessary for the initiation of a continuing streamer when the 
leader stroke is at different heights, so that the earlier this occurs the more chance is 
there of this streamer diverting the downcoming leader towards the lightning 
conductor. 


6. Current in the leader stroke 

A further check of the present theory is derived from a consideration of the 
magnitude of the current in the leader stroke, since this has been measured oseillo- 
graphically in strokes to the Empire State Building by McEachron ( 1939 ). From 
equation ( 1 ) it follows that the current required to maintain the corona discharge 
from the channel at time t from the start is given by 

i t = jy^+x^di ( 6 ) 

. = Jcvt{V* + V 0 X 1 vt+ 1/SXlvV), (7) 

which, for the flashes considered, rises to values of the order of 5000 amp. These 
-values appear high, especially in view of the fact that previous estimates of the 
steady currents during the leader stroke were of the order of 0-1 amp. to several 
amperes (Meek 1939 ; Schonland 1938 ). However, currents of the order of those now 
obtained are essential to account for the field changes observed by Appleton & 
Chapman, whose results are consistent with those of Wilson ( 1920 ), Wormell ( 1939 ) 
and Laby and others (x 940 ) as regards the magnitudes of the field changes observed, 
and with those of Schonland and others ( 1938 ) as regards the fraction of the total 
field change which occurs during the leader stroke. 

The values obtained for the currents in the leader stroke are certainly con¬ 
siderably higher than those observed oscillographically by McEachron ( 1939 ) in 
upward strokes from the Empire State Building, which last rose only to about 
500 amp. This is not surprising, however, since flashes must have occurred from the 
Building at field strengths which would not ordinarily have caused flashes to occur 
from the cloud, so that a lower value of the current is to be expected. That this in 
fact happened is seen from the very high number of flashes, as many as 11 or 19, 
which struck the Building in one day, numbers very much greater than the number 
of flashes which would have occurred over an area of normal country equal to that 
‘protected’, in an electrical sense, by the B uilding . It has been suggested (Meek 


VoL, 183. A. 


16 



238 C. E. R. Bruce 

1941 ) that these oscillograms only indicate the peaks of the current pulses. It is to 
be noted, however, that the envelopes of the troughs themselves reach about 300 and 
500 amp. respectively dining the leader stroke in the two of McEachron’s oscillo¬ 
grams to which reference is made by Meek. 

The currents observed by McEachron in fact enable the average field required to 
cause a flash from the Building to be calculated, simply by regarding (7) as an 
equation to determine X v The average gradient is thus found to be about 55 V/cm., 
using the same values of the corona constant and height of the cloud charge as in the 
previous section, so that the total potential required is thus only of the order of 
l;5x 10 7 V. This conclusion could be checked by recording the electrostatic fields 
and field changes during thunderstorms in the vicinity of the Building, either on 
one of the surrounding buildings, using the method of Appleton & Chapman, or in 
the space below the thundercloud, by the use of sounding balloons. 

This field, together with the height of the Building, yields for the potential 
difference between the top of the Building and the surrounding space necessary to 
initiate the discharge in this case a value of the order of 2 , 000,000 V. So far only 
flashes in which the Building was the positive electrode have been obtained, so that 
the above result applies at present for this polarity only, and for a field concentration 
equal to that caused by the Building. 

It will be noted that no account has been taken of the fact that, whereas all the 
downward leaders so far investigated have been negative, all the upward leaders 
from the Empire State Building have been positive. Though positive corona currents 
from wires exceed negative, there are so far no results to indicate whether this 
applies also in the case of these discharge channels. The fact that positive lightning 
currents tend to be greater than negative (Bruce & Golde 1941 ) indicates, as 
Allibone ( 1941 ) pointed out recently, that the same applies in this case, and that the 
higher positive lightning currents may result from the neutralization during the 
return stroke of the greater charge deposited around a positive channel by the larger 
corona currents. As, however, the velocity of positive return strokes has not yet been 
measured, there remains the possibility that it is higher than that of negative strokes, 
which would equally well explain the higher positive currents. It is to be hoped that 
photographs of positive flashes will ultimately be obtained by Schonland and his 
co-workers and settle this point. Meanwhile, it appears quite probable that the 
difference between the corona currents from positive and negative wires is due to 
the difficulty of getting free electrons out of the metal conductor in the latter case, 
a difficulty which will not arise when the conductor is a gas discharge channel, so that 
it is possible that the pronounced polarity effect will not be found in the latter case. 

The general agreement between the theoretical and observed values of the current 
in the leader stroke—the former tending to be high if anything—affords further 
evidence in favour of the low value of the thundercloud potential, in view of the 
dependence of the corona currents on the square of the potential. Since the values 
of the latter hitherto accepted were about 100 times greater than those now deduced, 
the leader stroke currents would be 10,000 times those actually observed. 



239 


The 'initiation of long electrical discharges 


7. CURRENT 3TR0M A POINTED CONDUCTOR 

It is of interest to compare the result obtained in the last section on the initial 
potential of the space surrounding the top of the Empire State Buil ding at the start 
of a flash, with results obtained in investigations of the currents from pointed con¬ 
ductors, since this current should be, on the theory now put forward, that required 
to yield a brush discharge current sufficient to cause the transition from glow to 
arc conditions, i.e. a current of about 05 or 1 amp. Both Warburg ( 1899 ) and Zeleny 
( 1907 , 1908 , 1941 ) have found that the current from a point can be represented by 
an expression of the form 


/ = cF(F-Pj), 


( 8 ) 


V t being the potential at which the discharge starts, and c a constant. In the present 
case this becomes simply r _ ^ T/2 


I = cVl 


Since for V = V 0 — 2 x 10 6 V, I should be equal to approximately 1 amp., c in this 
case is equal to 2-5,x 10 ~ 8 9 * * * 13 , a value agreeing very well with those found by the above- 
mentioned investigators, Warburg’s values being 1-48 x 10~ 13 and 4-83 xlO -13 , 
while Zeleny’s values lie between 2-6 x 10~ 13 and 4-2 x 10 ~ 13 . It would thus appear 
on the present data that both the corona constant and this constant on which the 
discharge from a point depends, can be extrapolated through a range of about 
1000 to 1 in potential. 


8 . Length oe the eirst leader stroke darts 

It was seen in § 6 that the potential difference between the top of the Empire 
State Building and the surrounding space at the start of a lightning flash from the 
Building is probably about 2 , 000,000 V, so that the lateral corona current from each 
centimetre length of the discharge channel, kVf, is initially about 1-6 x 10 -4 amp. 
Eor this current to build up to 0-5 or 1 amp. a length of 3000-6000 cm., or 100 - 
200 ft., is required. By the time the discharge reaches the bottom of the cloud, 
which may be taken as at about 1-25 km. (Simpson & Scrase 1937 ), PJ will have 
increased to about 5-6 million volts,* so that the length of the darts wall be reduced 
to about 400-800 cm., or 13-26 ft. We should thus expect the dart lengths observed 
by McEachron to lie between about 15 and 150 ft. The limits quoted by him are 19 
and 60 ft. in quite good agreement with the theoretical values. 

The comparison cannot be made precisely in the case of Sohonland’s observations 
since data on the initial fields are lacking, but indicates that the latter must be less 
than 100 V/cm. 

9 . Reillumination of the darts 

'As has already been mentioned, once arc conditions have been initiated in a dis¬ 

charge channel owing to the increase of the current therein beyond a certain value, 

* This potential of the channel relative to the surrounding space will build up at the rate of 

X t -X a = 55-10 = 45 V/em. 



240 


C. E. R. Bruce 


they are apparently maintained when the current is reduced down to a value of 
the order of one-tenth of that required to cause glow to arc transition in the 
first place. Again, the current which flows at any point in the leader stroke 
channel is that required to maintain the corona currents from the channel beyond 
the point in question. It follows from these two considerations that, after transition 
to arc conditions has occurred throughout the whole of the newly formed length of 
pilot streamer, the currents flowing in the last tenth of this length will be insufficient 
to maintain arc conditions therein, and retransition back to glow conductivity will 
occur. Consequently, when the new step is formed, about one-tenth of the old step 
will be reilluminated at the same time. It is therefore interesting to note that early 
in their investigation Schonland and his co-workers (1935) gave the following as the 
second of nine characteristics of the darts: e A bright step generally starts a little 
way back on part of the track formed by the previous step, so that the portion of the 
step* which is entirely new is about 90 % of the whole... exactly as the theory 
leads us to expect. ' v 


10. The lokg spark 

The long laboratory spark will, it is hoped, be dealt with on the present lines in 
the near future, but several of its outstanding characteristics may be mentioned as 
being in accord with the theory now advanced. In the first place, the currents in 
its leader stroke are of the order of 0 - 5-1 amp. (Tamm 1928; Allibone & Meek 1938; 
Komelkov 1940), as required if transition from glow to arc conditions were the 
criterion governing its initiation and progression. 

Again, in some of Allibone and Meek’s experiments, not only has the gaseous 
conductor gone over to the arc regime, but the conditions required for arc con¬ 
ductivity have in addition been established at the electrode surface, which is seen 
to be glowing in their figures 1 (c?), 8 (a), and in others of their photographs, during 
the leader stroke. The corona discharge from the leader stroke channel is likewise 
apparent, and is repeatedly referred to by the authors as a ‘shower of discharge’. 

11. Types a and leader strokes 

Schonland and his co-workers (1938) have found that in about one-third of the 
lightning flashes which they photographed the velocity of the leader stroke decreased 
during the second half of its travel from cloud to earth, and have attributed the 
higher values just below the cloud to the existence there of a volume of space 
charge. 

The present consideration of the corona currents from the discharge channel 
suggests, however, that the cause of the slowing down may be the inability of the 
discharge processes in the thundercloud to supply the charge required to maintain 
these rapidly increasing currents. Indeed many flashes never reach the earth at all, 
indicating that the cloud charge may actually be exhausted in supplying these 
currents. 



241 


The initiation of long electrical discharges 

12. The recombination theory of the stepped leader stroke 

Certain criticisms of the recombination theory of the stepped nature of the first 
leader stroke have already been offered (§2). It is, further, difficult to understand 
why the conductivity in the channel should decay, when once the latter has been 
formed, if, as the theory assumes (Meek 1939), a field of the order of that required 
for breakdown is available to restart the conduction processes after this assumed 
decay. It might perhaps be argued that the charge in the neighbourhood of the 
channel in the cloud becomes exhausted, but this explanation would fail in the case 
of the upward leaders from the Empire State Building, in which the steps occur in 
exactly the same manner as they do in the downward leaders from the cloud. 

The theory would also appear to leave the bright ends of the darts unexplained, 
as well as the reillumination of the last tenth of the preceding dart. There is also the 
important consideration that in no case has the return stroke been observed to find 
any difficulty in progressing along the whole leader stroke channel at a very high 
velocity. It would be a strange coincidence if no case had been photographed in 
which the leader stroke met the earth when recombination had progressed far 
towards the exhaustion of conductivity. 

13. Conclusions 

(а) The low values of the electric field required to cause a hghtning flash, of the 
order of one or a few hundred volts per centimetre, put forward in an earlier paper, 
are. confirmed* 

(б) Corona currents from the discharge channel are shown to account for the 
currents observed in the leader strokes of lightning flashes from the Empire State 
Building, and for the records of the electrostatic field changes caused by the leader 
strokes of flashes to earth. 

(c) The glow to arc transition theory of the darts in the first leader stroke of, a 
lightning flash is shown to be in accord with the observed characteristics of the 
darts, whereas the recombination theory is not considered to be. 

(d) Flashes probably occur to the Empire State Building when the average field 
between cloud and earth is only of the order of 50 V/cm. 

(e) The laboratory values of the corona constant and the constant on which the ■ 
discharge from a point depends would appear to allow of extrapolation through a 
range of 1000 to 1 in potential. 

(/) The potential of the thundercloud charge can be determined from records of 
the electrostatic field change caused by the leader stroke, together with a measure 
of the height of the charge, and hence the variation of the leader stroke velocity 
with the initial potential gradient, and the energy in individual flashes can likewise 
be determined. 

(g) The zone of protection afforded by a lightning conductor may be expected to 
increase with the current in a lightning flash. 



242 


C. E. R. Bruce 


References 


AUibone, T. E. 1941 0***- J • R ‘ Me !' Soc - 67 > ^ 27 ‘ „„ 

Allibone, T. E. & Meek, J. M. 1938 Proc - R °y- S ° c ' A ’ 166 ’ , 

Bruce* C. 1 R. & G^ITh. 1941 J-’ Instn EleaL Engrs> 88, 487, 

Pan, H. Y. 1939 Phys. Rev. 55, 769. 

Goodlet, B. L. 1937 J. Instn Elect. Engrs, 81, 487. 

mo 

145. 

Larmor, J. & Larmor, J. S. B. 1914 Proc. Roy. Soc. A, 90, 312. 

Larsen A, ioos Ann. Astrophys. Obs. Sndthson. Instn , p. lly. r .. 

Loeb, L. B. 1939 Fundamental processes of electrical discharge m gases : (a) P- > • ( ) P* 

London: Chapman and Hall Ltd. 

XSSTe ’ *9*4 Tram fS” ImospUriq^ et Tellurique. Paris: Les Presses Univer- 
sitaires de France. 

Matthias, A. 1929 Elektrotech. Z. 50, 1469. 

Meek, J. M. 1939 Phys. Rev. 55, 972. 

Meek, J. M. 1941 J- Instn Elect. Engrs, 88, 509. vr flW Yftrk . 

Peek, F. W. 19x5 Dielectric phenomena in high voltage engineering, P» * 

Pittm^^R^&Torok, J. J. 1931 Trans. Amer. Inst. Elect. Engrs, 50, 568. 

Scbonland, B. P. J. 1938 Proc. Roy. Soc. A, 164, 132; T A Flemine 

Sehonland, B. P. J. 1939 Terrestrial magnetism and electricity, p. 663, ed. J. A. Hem ng. 

New York and London. . 

Sehonland, B. P. J. & Collens, H. i 9 34 Proc - Ro V- So °- A > 143 > 654 - 
, Sehonland, B. F. J., Hodges, D. B. & Collens H. 1938 Proc. Roy. See . A, 166, 56. 
Sehonland, B. F. J., Malan, D. J. & Collens, H. 1935 Proc. Roy. Soc. A, 15 , 6 . 

Simpson, G. C. 1929 J. Instn Elect. Engrs, 67, 1269. 

Simpson, G. C. & Serase, F. J. 1937 Proc. Roy. Soc. A, 161, 309. 

Simpson, G. C. & Robinson, G. D. 1941 Proc. Roy. Soc. A, 177, 281. 

Tamm, R. 1928 Arch. Elektrotech. 19, 235. 

Todd, F. C. & Browne, T. E. 1930 Phys. Rev. 36, 732. 

Warburg, E. 1899 Ann. Phys., Lpz., 67, 72. 

Watson. E. A. 1910 Electrician, 64, 707. 

Whitehead, S. 1927 Dielectric phenomena: electrical discharges m gases, p. 135. Londo . 


Ernest Benn Ltd. 

Wilson, C. T. R. 1920 Phil. Trans. A, 220, 73. 
Wormell, T. W. 1939 Phil Trans. A, 238, 249. 
Zeleny, J* 1907 Phys. Rev. 25, 305. 

£eleny, J. 1908 Phys. Rev. 26, 129. 

Zeleny, J. 1941 V. Franklin Inst. 232, 23'. 



Address of the President 
Sir Henry Dale, O.M., G-.B.E., at the 
Anniversary Meeting, 30 November 1944 

The annual number of Obituary Notices of Fellows of the Royal Society published 
today, and the names which have just been read to us, remind us of the losses the 
Society has suffered. 

Allow me first to make brief mention of the last service rendered to the Society 
by one who had long been devoted to its interests, and whose name is among those 
of the Fellows whom death has taken from us during the past year. Sir Henry 
Lyons, who was our Foreign Secretary for a year, and then achieved so much for 
the Society in his full term of service as Treasurer, had acquired in that period 
a deep interest in the handling of the Society’s business and in the changes in its 
structure and its administration over the centuries of its history. He devoted the 
last four years of his life, under conditions which must have deterred any less 
resolute enthusiast, to the writing of an historical account of the administration 
of the Royal Society under its Charters, and at the time of his death, last August, 
he was eagerly awaiting its publication, which war-time difficulties had long delayed. 
This long expected contribution to our history was published a few weeks ago, 
and the Society will welcome and cherish it, not only as a record of value and 
interest in itself, but in memory of one to whose devoted labours the Society and 
its Fellows owe so much. 

Then let me say, on behalf of all the Fellows, a word of farewell and grateful 
acknowledgment to a member of our permanent staff who, after long and efficient 
service, is retiring from full duty. Mr Ronald Winckworth has for nearly 20 years, 
with only a short interruption, acted as assistant editor of our publications, and 
as the courteous helper and friend of all who have contributed to them. It may 
be doubted whether anyone else for so long a term has read, with a critical 
attention, every word and symbol of Philosophical Transactions and Proceedings, 
both A and B, and survived to accept the applause which such a feat deserves. 
Though our friend, Mr Winckworth, can no longer accept the full burden of that 
vigilant responsibility, you will be glad to know that we shall felill be able to call 
upon his experience for less frequent consultation. It is a matter for satisfaction, 
too, to think that science will not be without compensation for its loss over the 
wide range of our publications, since Mr Winckworth will have a greater freedom 
to concentrate his interest on his own special study of the Molluscs, on which he, 
is one of our leading experts. He may be assured that he takes with him the 
gratitude and good washes of us all. 




VoL 183. A. (22 February 1945) 


243 ] 



244 


Anniversary Address by Sir Henry Dale 

A year ago I reported to the Society that our Biological Secretary, Professor 
A. V. Hill, had left us on an important mission to India. The Council of the Society, 
at the invitation of the Indian Government, had nominated Professor Hill to visit 
India, to see its problems for him self, so that he might offer his advice on scientific 
matters in general and, in particular, on the adoption for India of a new and pro- 
* gressive programme of research and enterprise in science and its applications. 
From all sources—from the Viceroy and the Secretary of State with their official 
colleagues and counsellors, and directly from our own colleagues, the Indian men 
of science—we have had evidence of the unqualified success of this mission of co¬ 
operation and good will. We had nominated Professor Hill with a full confidence 
in his personal and scientific qualifications for such an undertaking. We knew, 
however, that the success of his mission would depend even more on the response 
made to his appeal by India’s men of science. To succeed, he must find them sharing 
his own conviction, that only a large and generous promotion of scientific and 
technological development in India would open for the teeming millions of its 
peoples any prospect of the advances in nutrition, health, prosperity and culture 
required to fit them for their proper place in such a world civilization as we shall soon 
be striving to rebuild on firmer scientific foundations. The response of India’s men 
of science has been no more in doubt than that of its Government. We sent Professor 
Hill to hold out the right hand of co-operation; India has sent us six of its scientific 
leaders to grasp it here in Britain, and we rejoice to have them with us to-day. 

In accordance with a plan which I mentioned here a year ago, the first meeting 
of the Society to be held outside this country, in all its long history, duly took place 
on 3 January of this year, on the occasion of the meeting of the Indian Science 
Congress in Delhi. The Viceroy, Lord Wavell, honoured the meeting with his 
presence, and Professor Hill, as a Vice-President, was able to give formal admission 
to two of our Indian Fellows, Professor Bhabha and Sir Shanti Bhatnagar, and to 
receive their signatures on a special sheet of parchment. India’s prompt and 
generous response to the invitation which Professor Hill extended, has now brought 
Sir Shanti Bhatnagar here as a member of the return mission, so that he has been 
able already to sign the Charter Book itself. We shall hope that opportunities will 
yet occur for all Indian men of science who have been, or will in the future be, 
elected to our Fellowship, to come here to the Society’s home and to inscribe their 
names in the book which our Fellows have signed since 1662 . This, our ancient 
and now historic Charter Book is largely representative, in its earlier pages, of 
the springtime of modern science, not only in this country, but in all the western 
world; and we desire that it shall now also bear witness to a growing freedom of 
scientific contact and interchange with the great centre of eastern culture, from 
which our visitors have come. Their journey to Britain, and later to Canada and 
the United State’s of America, should mark the beginning of a new era of col¬ 
laboration in science between India and the rest of the world, growing ever easier 
and more intimate as the means of world transport, with the added impetus which 
war’s demands have given, attain yet higher levels of speed and safety. 



245 


Anniversary Address by Sir Henry Dale 

Our Indian colleagues are coming to the end of a visit which, we hope, has given 
them a new insight into the scientific activities and organization with which this 
country is still meeting the demands of war, and preparing for the tasks which will 
later have to be shouldered by a tired though triumphant country, in a largely 
devastated world. We hope that they will have learned, perhaps from our failures 
as well as from our achievements, lessons which will have applications of real value 
to their own country’s scientific problems. All Fellows of the Royal Society have 
been glad to know that our Apartments have served our Indian guests as an official 
home and a point of departure for their various visits and engagments, and that 
Professor Hill has thus been able to keep a close and friendly eye upon this grati¬ 
fying fulfilment of his hopes and his plans. Two days ago this visit of our Indian 
colleagues, nearing its end, reached also its climax, when Their Majesties the 
Kang and Queen were graciously pleased to receive them, and thus to show 
their interest in the promise of a closer understanding and comradeship in 
science between India, the United Kingdom, and the whole of the British Empire. 

I made preliminary mention last year of the aotion taken by the Council, in 
appointing a special Committee to consider the prospective needs of fundamental 
researches in Physics. As was then foreseen, the news of this response to an appeal 
from the sponsors of Physics soon evoked similar pleas from those concerned for 
such researches in a number of other branches of science. The Council agreed, 
accordingly, to appoint a series of other committees to advise upon the respective 
requirements, in the period following the war, of fundamental researches in Chem¬ 
istry, Biology, Geology, Geophysics, Geography and Meteorology. Reports from 
all these Committees have been received and considered, together with one of 
similar aim presented by the standing Sub-Committee, on Oceanography, of the 
National Committee for Geodesy and Geophysics; and it is by no means certain that 
the tale of these special reports and appeals is entirely complete even now. It will be 
understood that the Society, in undertaking this investigation, has endeavoured to 
centre its enquiry on researches aiming at the advancement of knowledge without 
immediate or even implicit reference to practical needs or objectives. It has done 
this, not because it regards those researches, which have in view the use of 
scientific methods and principles for the service of man’s material needs, as of 
inferior status or interest, or as being, by their nature, less worthy of the Society’s 
consideration. We do not need to be reminded that researches seeking only the 
advancement of pure science have often revealed the shortest ways to practical 
developments, or that those undertaken with practical ends may chance to 
open windows upon new vistas of knowledge. It may be hoped that the Royal 
Society will never lose the comprehensive aim defined in its Charter, of 'further 
promoting by the authority of experiments the sciences of natural things and of 
useful arts’. It will be understood, however, that the Council had to keep in mind 
the increasing extent to which support by public money is already given, through 
the three Advisory Councils, to those researches which are aimed, more or less 
directly, at obtaining and using scientific knowledge for the promotion of industrial 



246 . Anniversary Address by Sir Henry Dale 

and material progress, of human health and of agriculture; and they had to keep 
also in view the more immediate appeal which is made to private benevolence 
and enterprise, by researches with objectives thus already in view. As I said 
last year, the Royal Society must, in such circumstances, regard as its special 
concern the proper nourishment of the roots of the tree of knowledge. We hope 
that the report which the Council have now adopted, based upon those of the 
special Committees, will enable the Society to evoke an effective response to its 
own conviction of the supreme importance, for the nation’s future, of provision 
for the support of scientific researches at the frontiers of the unknown, on a scale 
more generous than any which has hitherto come within the range of official vision. 
The Society is making representations to the appropriate Departments and 
Authorities, and stating its conclusions as to the nature and the dimensions of the 
various grants of funds and facilities which will be needed, to enable this country 
to keep its place in the general advance of science, in the altered world which will 
emerge from the war. 

I ought to make further brief reference to another matter on which I spoke 
in some detail last year, namely the accommodation provided by the nation for 
those scientific societies which, according to the pattern and tradition of our own, 
devote their activities to the promotion of natural knowledge for the general 
advantage, without reference to private interests, or to professional status and 
discipline. Such provision had been accepted as a national obligation as early as 
1778. The quarters at present provided, here in Burlington House, represent the 
allotment made in 1869 to the societies which then achieved such recognition. Since 
that date no enlargement or material alteration has been made, and there seemed to 
be no prospect of any, either for the societies admitted 75 years ago, or for others 
whose claims to such privilege have later appeared. Even the accommodation thus 
assigned in 1869 appears to have been only a remainder, after the major part of 
an earlier and more generous plan for science had been diverted to other purposes 
by intervening claims. In 1869, we should remember, the study and promotion of 
the sciences had hardly been recognized as a life’s occupation, except for a few 
determined devotees;, science was at that date still largely a matter for the leisure 
of cultured amateurs. It is no matter for surprise, therefore, that accommodation 
then regarded as acceptable should now be found to make grossly inadequate 
provision for some of the more important of present scientific needs, and none at 
all for others. 

After I had drawn public attention to this anomaly last year, we were asked, by 
several of the specialist societies concerned, to take appropriate action with the 
Government. We approached the Lord President of the Council, Mr Attlee, and on 
13 October, with the Chancellor of the Exchequer, Sir John An derson, and the 
Minister of Works and Buildings, Lord Portal, he kindly received a deputation 
representing all the Societies which have been housed here since 1869, and several 
others for which the Government has hitherto made no provision. Apart from the 
special and urgent needs of some of the societies, the case was fully presented for 



247 


Anniversary Address by Sir Henry Dale 

the inclusion, in any scheme for the rebuilding of London, of a centre adequate to 
accommodate the principal scientific societies on a •well* co-ordinated plan, and so 
placed and designed as worthily to symbolize the central importance of British 
science, for the cultural as well as the material aspects of the civilization of this 
nation, and, indeed, of the world. The Ministers gave a careful and sympathetic 
hearing to the statement of our claims, and asked us to furnish quantitative data, 
as a basis for their further consideration of the problem. 

With the outcome of the war even more certain, there can be no relaxation yet 
of the demand on what our scientific, effort can contribute to the hastening of its 
end. It is none the less our duty to begin to look further ahead and to prepare for 
the part which science must play in the world which will follow. The needs of 
alliance in war have evoked, especially between the two great branches of the 
English-speaking nations, a closer interchange and collaboration in science, 
between men of different national traditions and loyalties, than has ever before 
been a matter of organized policy. It is not too early to begin to consider ? to what 
degree, and in what form, such a collaborative effort should be continued into the 
conditions of peace, and extended to scientists of international good will through¬ 
out the world. Even in the twenty years of uneasy armistice which ended in 1939, 
a measure of co-operation among the world’s scientists was achieved, which held 
promise as a potential agent of international understanding. Our traditions go 
back to days when our Fellows belonged to a community embracing all Europe in 
its enthusiasm for the new experimental philosophy, and the Royal Society will 
have a particular duty to be among the leaders in the resumption of international 
activities in science, and to use all its influence to establish these on an ever 
wider and firmer basis. We have a standing Committee on international relations 
in science, with our Foreign Secretary appropriately as its Chairman, to pre¬ 
pare for what action the Society can usefully undertake or promote, as the 
opportunity presents itself. Meanwhile, we may observe other signs that the spirit 
of international friendship and recognition in science is beginning to move again, 
even while the chaos of war is still with us. Among such signs, we may note that 
one of the Committees which, since their foundation in Stockholm, have awarded the 
Nobel Prizes with unchallenged impartiality among the scientific discoverers of all 
nations, has resumed its awards this year. We in this Society have welcomed the 
return to London of a group of distinguished French leaders in science from the 
United States of America, whither they had escaped from the hostile occupation 
of their country. We have been able to share their rejoicing at the liberation of 
France, and to welcome here others who had remained there, often in hiding and 
always in peril, as leaders in the steadfast resistance opposed by all but a negligible 
minority of the French men of science to the enemy’s demands for their collabora¬ 
tion. Even today we are able to welcome another distinguished French colleague 
just arrived from Paris, Professor Emil Borel. The Society had the pleasure of 
providing a special opportunity for our Fellows to meet our French colleagues, 
with those from other allied countries who have long been in exile here during 



248 


Anniversary Address by Sir Henry Dale 

enemy occupation of their homelands. We are glad to think that new and lasting 
bonds of comradeship in science have been created for us with those who have 
been our country’s wartime guests, and through them with all the men of science 
in the countries which they represent. A happy chance brings also to our meeting 
today four men of science from Soviet Russia. Very near to the heart of every 
British scientist is the desire for a growing intimacy of confidence and collabora¬ 
tion with our colleagues of that great partner-nation in the war for the world’s 
freedom. « 

The Society had recently the opportunity#of showing its interest in the revival 
of international scientific co-operation in another special connexion. From 16 to 
19 October a small International Conference met here, under the auspices of the 
Health Organization of the League of Nations, to discuss the creation of an inter¬ 
national standard of reference for Penicillin and the definition, in terms of this, of 
a unit of activity. Though the League has failed tragically of its central purpose, 
it has less conspicuous achievements to its credit, and -science has an interest in 
ensuring the permanence of some of these. I have myself had the privilege of taking 
part in the activities of an International Commission under the Health Organiza¬ 
tion of the League, which succeeded, in the years between the wars, in obtaining 
world-wide acceptance of standards and units of activity for a whole range of 
modern remedies—antitoxins, hormones, vitamins and certain drugs—the strength 
of which could only be determined by direct biological measurements of the specific 
activity, in comparison with that of a fixed standard preparation in each case. 
Insulin was an early instance of a new remedy requiring such intervention; its 
general use for the treatment of diabetes could not have attained the present level 
of safety and effectiveness, unless a world-wide uniformity on these lines had 
replaced the chaos of widely different units in different countries, which was 
threatened in 1923. And now research has produced another new remedy, Peni¬ 
cillin, the success of which, in the treatment of a range of dangerous infections, has 
also had such a dramatic* quality, that its reputation has spread rapidly beyond 
our scientific community and caught the interest even of a war-distracted world. 
Here, indeed, was a discovery which could rank as a major contribution of 
science to the mitigation of the suffering which war infli cts, and, at the same 
time and no less, as a gift of healing to mankind at peace. The needs of war 
had given a stimulus to the researches which proved Penicillin’s remedial value, 
but its rapid production on an adequate scale had to face greater difficulties 
in our own country, where material and human resources had been more com¬ 
pletely absorbed by earlier requisitions, than in the United States. So the present 
position was reached, in which, as we are proud to recognize, the existence of 
.Penicillin and then, after a decade, the methods by which it could be separated in 
sufficient punty to demonstrate its brilliant possibilities as a remedy, were dis¬ 
coveries made here in England, while, in the further researches and technical 
developments needed for its large-scale production, our American colleagues have 
played a major part. 



249 


Anniversary Address by Sir Henry Dale 

Thus early in its history, therefore, Penicillin and its applications had become 
a matter of international concern; and, though war had restricted the work in 
this field almost entirely to scientists of the English-speaking peoples, and had 
brought them into an unusual intimacy of co-operation, progress had been 
so rapid and action so urgent, that there was a real danger of a divergence of 
meaning in the termsused to express its activity and define its dosage, even among 
the few countries already using it. Prompt action was required to avert this by 
accepting a common standard of reference; and, when the proposal of a Conference 
for this purpose was made from this country to the League of Nations Health 
Section, we were grateful to our colleagues from the United States, as well as from 
Canada and Australia, for the generous promptitude with which they agreed to 
ma£e the journey to England, so as to meet with us here in London. After all arrange¬ 
ments for the holding of the Conference here had been •completed, the liberation of 
Paris opened a new possibility; Dr Trefouel, now Director of the Pasteur Institute 
in Paris, was able at the last moment to accept an invitation to join us, and thus 
to give our deliberations, and our eventual agreement, a wider international basis. 
To illustrate how rapidly a divergence may arise under present conditions, I may 
just mention the fact that several different Penicillins have now been recog¬ 
nized, produced by variations in the metabolism of the growth, possibly due to 
mutations of the mould itself, possibly to changes in the nutritive conditions 
offered to it by the medium or the cultural method employed. Three such varieties 
of Penicillin have already been isolated in pure condition, and distinguished by 
certain chemical characters; but, while English workers had come to refer to 
these as Penicillin 1, 2, and 3, their colleagues in America spoke of Penicillin F, G, 
and X; and it was not until they met round the table in our Council Room a few 
weeks ago, but then in less than 10 minutes, that they became quite certain of the 
identity of 1 with F, of 2 with G, and of 3 with X. All these Penicillins have 
the specific remedial action in high, though not quite identical degrees, and there 
are probably differences, still to be explored, in "their proportional efficiencies 
against different infective organisms. When once their identities were thus 
put beyond doubt, however, the small Conference had no hesitation in deciding, 
for the present, to use as the common basis of reference a sample of the Penicillin 
which is predominant in most preparations now available, and most easily obtained 
as a pure salt in adequate quantities. The unit could then be defined as the activity 
of a precise, though very small weight—0*6 fig .—of a particular sample of the 
perfectly dried, crystalline sodium salt of Penicillin 2, or G; and the unit thus 
chosen for definite fixation, and for international recognition henceforward, was, 
by a unanimous choice, so defined as to be as closely equivalent as possible to the 
unit first propounded by Sir Howard Florey's team of collaborators, and widely 
known as the 'Oxford' unit. 

The International Standard for Penicillin is thus added to an already numerous 
series, of which the custody, on behalf of the League of Nations Health Organiza¬ 
tion, has been shared by our own National Institute for Medical Research with the 



250 


Anniversary Address by Sir Henry Dale 

State Serum Institute of Denmark, at Copenhagen; and all of these standards, we 
may hope, will he available for transfer to whatever international authority may 
be established in succession to the League, as a tangible and material result of 
genuinely international collaboration, which the League has been able to initiate 
and maintain among men of science, to the permanent advantage of the world. 

Though Penic illin has rightly made a special appeal to the imagination and 
sympathetic interest of a wide public, it is, of course, only one out of a varied 
range of inventions and discoveries, hastened by the stimulus of war’s demands 
and produced, in many cases, behind the veil of its secrecy, but ready, when peace 
returns, to take their proper place as new gifts to the welfare and the civilized 
progress of mankind. From what has already been made generally known, it is 
clear that we may look forward to revolutionary advances in the means of com¬ 
munication and in the speed and safety of travel across the world, and in methods 
of controlling insect pests and the diseases which insects convey; and these are but 
a few examples of the gains which we and the world may hope to set against the 
tragic loss and sacrifice of the years of war. There were probably few who even 
suspected in 1939, that science, in countries then so dangerously unready, would 
find itself, before the war ended, in its present position of central importance. 
None of us, I think, would claim more for science even now, than to have played 
in this war a part of growing predominance in the provision for the fighting men 
of the material means of warfare, without which their heroism and sacrifice could 
not have prevailed. Even that duty, loyally accepted, is one from which the 
scientific community of the free nations must long for the release which victory 
will bring. But, while the operations of war have come to depend on science to 
a degree beyond all earlier experience, it cannot be doubted that little more than 
a beginning has yet been made in exploiting the possibilities of destruction, which 
science could progressively offer, if the world should continue thus to misuse it, 
and if science were still on offer for such ends. Allow me to quote a passage from 
a letter which the Prime Minister, whom we are proud to number among our 
Fellows, wrote a year ago to Professor Hill, in sending his greetings to Indian 
men of science. 

e It is the great tragedy of our time’, wrote Mr Churchill, c that the fruits of 
science should by a monstrous perversion have been turned on so vast a scale to 
evil ends. But that is no fault of science. Science has given to this generation the 
means of unlimited disaster or of unlimited progress. When this war is won we 
shall have averted disaster. There will remain the greater task of directing know¬ 
ledge lastingly towards the purposes of peace and human good.’ Noble words indeed, 
and a profession of faith which will find an immediate echo in the hope and the 
desire of every true man of science. 4 When this war is over we shall have averted 
disaster’—surely that is a confidence which every one of us will long to share. 
It must be clear, however, that Mr Churchill’s reference was to the present threat 
of disaster, from which the prospect of our escape is even more fully assured to-day 
than when he wrote, a year ago. We may be certain that nobody sees more clearly 



251 


Anniversary Address by Sir Henry Dale 

than he, that the threat of final disaster to all man’s hopes and achievements will 
not be forever averted, if the possibility of the ‘monstrous perversion’ of science 
is allowed to remain and to continue its evil growth. Even in the past year our 
enemies have thrown a new and vivid light on future possibilities, by the new 
weapons which science has enabled them to put on trial for our destruction. Though 
a people’s unflin ching courage and an answering effort of science and organiza¬ 
tion, together with the progress of the allied armies over the launching areas, have 
given us confidence that flying bombs and the like will not affect the issue of 
this war, the warning which they give, as to what the future might hold, is not 
* the less clear. The writing on the wall must be plain for all to read. If, when 
the memories of the present war begin to fade, the world should allow science 
again to be exploited by a nation grasping at predominance by conquest, science 
will no longer be invoked only as an aid to what valour can achieve by land, sea 
or air, but as an agent, in itself, of blind annihilation at an ever lengthening range. 
When we men of science regain that freedom, for the ultimate preservation of 
which we have loyally accepted, through these tragic years, the bonds of secrecy 
and submission to authority, we cannot put aside with these our proper share in the 
new responsibility for the future of mankind, which this war’s experiences have 
laid upon the men of good will in all nations. It is true, indeed, that neither 
the present abuse of science, nor any possibility of final disaster to civilization, 
which might come of a future perversion of its powers, can be charged as a fault 
to science itself; no more, indeed, than we could properly charge to religion, as 
such, the wars which once devastated much of Europe in its name. But we men of 
science cannot escape from our growing share in the responsibility, in ‘ the greater 
task’, as Mr Churchill has written, ‘of directing knowledge lastingly towards the 
purposes of peace and human good’. No man of science has the right to prescribe 
for another his interpretation in detail of that duty; but there is one aim which 
may unite us, perhaps for the most effective action within our common grasp, 
and one which is worthy of all our common influence and effort. Let me quote 
again from Mr Churchill’s letter: ‘in this task’, he writes, ‘the scientists of the 
world, united by the bond of a single purpose which overrides all bounds of race 
and language, can play a leading and inspiring part ’. To build anew, and on a 
firm and broadening foundation, a world community in science, is surely an aim 
worthy of our utmost effort and devotion; but there can be no swerving from the 
present duty, and the call on science by war may yet be sterner, before we have 
won the freedom thus to work for the future of the world. 



252 


Anniversary Address by Sir Henry Dale 


Awards of^Medals, 1944 

The Copley Medal is awarded to Sir Geoffrey Ingram Taylor in recognition 
of his contributions to knowledge of aerodynamics, of hydrodynamics and of the 
structure of metals, which have had a profound influence on the advance of 
physical science and of its practical applications. 

Geoffrey Taylor is probably the most accomplished living exponent of the 
application of the methods of classical dynamics to problems of fluid motion. To 
great mathematical powers he adds high skill as an experimenter. His theoretical 
work is particularly noteworthy for its approach to reality. In place pf the ideal 
conceptions presented by perfect incompressible fluids moving in stream-line 
motions and perfectly elastic solids, with which his great forerunners at Cambridge 
dealt, Taylor has studied turbulent motion, viscous and compressible fluids, and 
plastic movements of metals, obtaining results of great importance for the under¬ 
standing of a wide range of phenomena. 

His early work was concerned with eddy motion in the atmosphere, and opened 
up new fields of meteorological investigation. It threw light on the variation of wind 
with height and on the transference of heat and water vapour in the atmosphere, 
with a consequent bearing on the formation of fog. He also carried out work on 
the tides. Later he developed the theory of general turbulence, to which statistical 
methods can be applied which are somewhat reminiscent of the kinetic theory 
of gases. 

Among Taylor’s extensive researches on precise hydrodynamical problems, that 
on the motion of a viscous fluid between two coaxial cylinders, rotating with any 
speed in the same or in opposite directions, may be particularly mentioned, since 
it offers the only case so far of the complete solution of a problem of motional 
instability in the viscous liquid. In dealing with the elastic deformation of metals 
Taylor has shown how the slip planes can be determined in certain cases by purely 
geometrical methods, and has offered a formal theory of the process of work¬ 
hardening in single crystals. 

Taylor has also applied his great mathematical powers to a variety of practical 
questions. In the last war he did work of great distinction on aerodynamical 
problems for the Advisory Committee on Aeronautics, and in the present war he 
has been extensively concerned with complicated problems concerning the pro¬ 
pagation of explosive processes. Taylor’s work may be said to be in the line of 
a great British tradition, which, in the past generation, was represented by 
investigators like W. J. M. Rankine, Osborne Reynolds and Rayleigh. Like these 
he has the mathematical equipment, the originality and the insight required 
for the fundamental solution of problems presented by practical experience 
in the laboratory, in the workshop, and in the wider world. Taylor has carried 
his quest for experience and for scientific problems on to the sea and into the air. 
He is a practical yachtsman and mariner, with a new design of anchors to his credit, 



253 


Anniversary Address by Sir Henry Dale 

and an aviator who, in the last war, made pioneer descents from aeroplanes by 
parachute. His work during this war has been of the greatest value to the nation 
and its allies, and his fundamental discoveries are extending the boundaries of 
knowledge for all mankind. 


The Rumeord Med An is awarded to Dr Harry Ralph Ricardo in recognition 
of his important researches on the internal combustion engine. 

There is a special fitness at the present time in the award of this Medal to one 
who, during the last twenty years, has been the leading spirit in the development 
of the high-speed internal combustion engine. Ricardo’s researches were begun 
under Bertram Hopkinson in 1905 and continued, after he left Cambridge, as a 
consulting engineer in his grandfather’s firm. Investigating the effect of turbulence 
on the speed of combustion, he was led to appreciate the importance of ‘knocking’, 
to determine its cause and to show that the tendency to ‘knock’ was dependent 
on the nature of the fuel. Taking charge of a special design department for his 
firm, he produced a four-cycle, supercharged aero-engine, long in advance of 
accepted practice. In 1916 he was invited to plan a special engine for the secret 
fighting machine which was to become known as the tank, and his unorthodox 
and daring design was an outstanding success. Forming a private company to 
maintain a laboratory for research on the internal combustion engine, Ricardo 
further investigated the relation of the phenomenon of ‘knocking’ to the maximum 
compression-ratio of the engine and to the character of the fuel, matching the 
latter by adding toluene in variable proportion to heptane, and thus paving the 
way for the modern octane-rating. It is not possible here to make more than 
general reference to the far-reaching influence of Ricardo’s investigations and his 
steady advocacy on the designs of slide-valve engines, of sleeve-valve aero-engines, 
of high-speed Diesel engines, and on other important developments in engine- 
design. In all directions there is evidence of his special genius and flair for design, 
and, behind this, of his full appreciation of the thermodynamical principles which 
control the behaviour of engines, and of a deep knowledge of the physical and 
chemical factors involved, as well as of the characters of fuels and of the materials 
of the working parts. 


A Royal Medal is awarded to Professor David Brunt in recognition of his 
fundamental contributions to meteorology. 

David Brunt has made fundamental contributions to meteorology in its statistical, 
dynamical and physical aspects. The subjects which he has treated include cycles 
in weather; atmospheric radiation; atmospheric turbulence; the dynamical causes 
of rainfall; instability and convection in their bearing on the forms of clouds and 
on soaring flight; and the dynamics of depressions and anticyclones. He has 



254 Anniversary Address by Sir Henry Dale 

rendered an outstanding service to his subject by Ms book on Physical and 
Dynamical Meteorology , in which he gives the first connected and critical account 
of the physics and dynamics of the atmosphere, and reduces to order a large 
amount of material wMch was previously available only in isolated papers. The 
book contains much original matter, and has played a leading part in the recent 
development of meteorology in all countries. 

Brunt has been a pioneer in the analytical approach to his subject. Of recent 
years he has devoted several papers to the discussion of the factors wMch influence 
bodily comfort, and has gone far to provide a natural basis for the classification 
of climates in relation to human health and human needs. 

For many years Brunt has conducted, at the Imperial College of Science and 
Technology, a flourishing school of meteorology, which has attracted students 
from all parts of the world. He has always been generous with his services to 
colleagues in other fields who have required expert meteorological assistance. 
During the war Ms wide knowledge and sound judgment on meteorological 
questions has been of the utmost value to the cause of the nation and its allies. 


A Royal Medal is awarded to Dr Charles Robert Harhstgton- in recognition 
of his work on the structure and synthesis of thyroxine, and on the chemical 
basis of immune reactions. 

Harington’s reputation, as a leader among biochemical investigators, was 
established by a brilliant series of researches dealing with the chemical nature, 
the origin, and the form of the natural combination of the thyroid hormone, 
thyroxine, with its remarkable content of iodine. He improved the method of 
isolating tMs active principle from the thyroid gland, determined its structural 
constitution and then produced it by artificial synthesis. Later he demonstrated 
that diiodotyrosine was present in the gland and accounted for the balance of its 
iodine content. By enzymatic cleavage he proceeded to show that thyroxine and 
diiodotyrosine are natural amino-acid constituents of the complex thyreoglobulin. 
These discoveries gave an entirely new precision to knowledge of the thyroid 
hormone, of the manner of its natural occurrence and function, and of the diseases 
wMch result from excess or defect of its supply from the gland. In more recent 
years Harington’s work has contributed very important advances to knowledge 
of the chemical basis of immunological specificity. By a new method of coupling 
haptene groups artificially to proteins, he has studied the role of carbohydrates 
and of tyrosine in developing antigenic properties. By such methods he. has 
created artificial antigens, the specificity of wMch is determined by the attachment 
of physiological active haptenes, such as thyroxine and acetyisalicylic acid. He 
has thus produced and determined the limits of specificity of antisera reacting 
with free thyroxine or acetyisalicylic acid, and has observed the antagonism of 
such sera tf* the physiological effects of such principles in the animal body. In 



255 


Anniversary Address by Sir Henry Dale 

other directions also Harington has made brilliant contributions to biochemical 
knowledge, as by his work on the conditions determining the crystallization of 
insulin and on the synthesis of glutathione. 


The Davy Medal is awarded to Sir Robert Robertson in recognition of his 
researches on explosives, analytical methods, the internal structure of the diamond 
and infra-red absorption spectra. 

After studying at St Andrews, Robertson made his first acquaintance with the 
field of explosives as a chemist at the Waltham Abbey Royal Gunpowder Factory, 
where he was occupied on the nitroglycerine plant. He acquired a knowledge of 
all aspects of the manufacture of cordite and contributed improvements, such as 
the acetone recovery process which has been widely adopted. His researches in 
t his period covered calorimetric measurements and in particular the study of the 
kinetics of the decomposition of gun cotton, which led to the publication of work 
of great practical importance on the stabilization of that material (1916). 

A succession of spontaneous explosions in cordite magazines led to a visit by 
Robertson to India and to the issue of an exhaustive and valuable report in 
February, 1907. Shortly afterwards he entered the Research Department at 
Woolwich as Superintending Chemist, later director of Explosives Research, and 
he occupied these posts with distinction until 1920. Much valuable work was done 
before and during the war of 1914-18. A process for the manufacture of TJST.T. 
introduced novel features and prepared the way for the large TJST.T. factories 
established under Lord Moulton. Other notable achievements by Robertson were 
the introduction of cordite R.D.B., which relieved the acetone position, and of 
amatol, of which it was said by the Director of Artillery that* amatol won the war 5 . 
Throughout his time at the Woolwich Research Department, Robertson showed 
himself a resourceful investigator, an able leader and an indefatigable worker. 

There followed for Robertson a further period of effective organization and 
active enterprise in research, as Government Chemist. He was one of the first to 
recognize the importance of infra-red spectrography for the determination of 
molecular structure, and in his pioneering work on the infra-red spectra of ammonia 
and of arsine he pushed the accuracy of the instruments then available to their 
ultimate limi ts. These researches drew the attention of chemists to the possibilities 
of the analysis of molecular vibrational and rotational bands, and materially 
assisted in opening up the wide field which has been explored in recent years. 
Robertson’s studies of the absorption spectra of diamonds have produced results 
of very great interest; they show that diamonds exist in two types differentiated 
by the condition of strain originating in their high-temperature formation, and 
by the mosaic character of the less usual type. 

During the present war Robertson has occupied very responsible positions in 
relation to the earlier field of his researches and has played a keen and active part 
in the contributioh of chemistry to the national emergency. 



256 


Anniversary Address by Sir Henry Dale 

The Darwin Medal is awarded to Dr John Stanley Gardiner in recognition 
of his life’s work on coral reefs, 

Gardiner is universally recognized as an authority on coral reefs and on the 
organisms associated with such habitats. His contributions to these fields of 
biological and geographical research began not long after his graduation, when he 
was a member of the coral reef boring expedition to the Atoll of Funafuti, organized 
by s the Royal Society in 1896. Since then he has himself organized and led two 
most important expeditions, the first to the Maidive and Laccadive Archipelagoes 
in 1899 and the second to the Indian Ocean in 1905; the results of these expeditions 
are embodied in nine large quarto volumes and represent a most valuable contri¬ 
bution to a field of knowledge closely associated with the work of Charles Darwin. 
Within recent years Gardiner has organized and, to a large extent, directed the 
Cambridge Expedition to the Suez Canal, 1924; the Great Barrier Reef Expedition, 
1928-31; the John Murray Expedition to the Indian Ocean, 1933-34; and the 
Expedition to Lake Titica*ca in 1937. He is an authority on the taxonomy 
and systematics of Aleyonarian and Zoantharian Corals, and has taken a keen 
interest in their ecology and geographical distribution. He realized the great 
importance, in the study of corals, of the examination of the polyps themselves, 
as well as of their coralla, and he paid special attention to variations which may 
result from slight differences of habitat and are correlated with physical and 
other conditions, showing that in several instances so-called 'species’ are merely 
variations. 

He also realized the immense value of an accurate knowledge of the coral 
fauna of any given locality in relation to its environment, in enabling *one to 
deduce the conditions under which tertiary and earlier coralline deposits have 
been formed. 

There is hardly a branch of research on corals and coral reefs in which Gardiner’s 
work is not of great importance. It was his observations on the Funafuti Atoll 
and the Atolls of the Maidive and Laccadive Archipelagoes that caused him to 
realize that no one theory, such as the 'subsidence’ theory of Darwin, or the 
'solution’ theory of Murray, can account for the formation of all such reefs and 
atolls though, when once formed, every reef has been moulded and modified 
by world-wide phenomena, such as a change in the relative levels of sea and 
land. 

Stanley Gardiner has given us an admirable summary of this, his life’s work, 
and of the conclusions that he has drawn from it, in his book Coral Reefs and Atolls, 
a most valuable supplement to Darwin’s own volume On the structure and distri¬ 
bution of coral reefs . There is a special fitness in the award of-the Darwin Medal 
for work of such 'acknowledged distinction in the field in which Charles Darwin 
himself laboured ’. 



257 


Anniversary Address by Sir Henry Dale 

The Hughes Medal is awarded to Professor George Ingle Pinch in recognition 
of his fundamental contributions to the study of the structure and properties of 
surfaces; and for his important work on the electrical ignition of gases. 

Finch has carried out two important bodies of research in different fields, both 
involving electrical considerations in a fundamental manner. The first was a 
detailed study of the electrical ignition of gases, the second the application of 
electron diffraction to a wide range of chemical and physical surface problems. 

In his work on electrical ignition, he not only elucidated the chemistry of the 
ignition of simple gaseous systems, but was the first to develop the theory of the 
sparking ignition coil. His inductance component control interrupter has been 
used by the Radio Research Board for the production of single electromagnetic 
pulses. In the course of his ignition work Finch developed, as a pioneer in this 
country, the high speed cathode-ray oscillograph. 

In the field of electron diffraction Finch has developed the electron diffraction 
camera into an equipment giving results of high accuracy with speed and ease 
of manipulation. The Finch camera has found wide application outside his laboratory 
and examples made under his direction have been installed, among other places, 
at the National Physical Laboratory, University College, London, the University 
of Brussels (two), and in the laboratories of Messrs Ferranti and of other industrial 
research centres. The pictures which he has obtained with it are outstanding in 
beauty of detail. He has contributed notably to the interpretation of the electron 
diffraction pattern and has applied his methods to many problems of theoretical 
and practical importance. 

Of special interest are his studies of the relation between crystal size and lattice 
dimensions and, in the more practical field, his investigations into the effect of 
the substrate on adhesion of electro deposits, into the nature of polish and into 
the mechanism of boundary lubrication and the wear of sliding surfaces. In all 
these he has materially advanced our knowledge, and his work on sliding surfaces, 
in particular, has found important applications in engineering practice. 

His work during the war has covered a variety of fields, some involving the 
application of electron diffraction. That which he has earned out as scientific 
adviser to the Ministry of Home Security, while less closely related to his normal 
lines of research, has been of the greatest value. 



Relaxation methods applied to engineering problems 

XI. Problems governed by the ‘quasi-plane- 
potential equation ’ 

■ By D. N. de G. Allen, R. V. Southwell, P.R.S. and Gillian Vaisey 


(Received 24 March 1944) 


The 4 quasi-plane-potential equation* 


i( x m+t( x w\ 

Bx\ X dx)^ dy \ dy) 


+£ = 0 


is discussed in relation to problems where Z and x are knoivn functions of x and y. It governs 
(inter alia) the small transverse displacement of a membrane in which the tension Tccx, and 
its fini te-difference approximation governs the small transverse displacements of nodal 
points of a net in which, similarly, the string tension T varies from node to node. (Equi¬ 
librium in the directions of x and y can be maintained, both in the membrane and in the net, 
by forces acting in those directions and accordingly having no effect on the transverse 
equilibrium.) 

The relaxational treatment, based on this mechanical (net) analogue, reduces when x is 
constant to the treatment developed, in earlier papers of this series, for problems governed 
by the ‘plane-harmonic* (Poisson) equation 


S^ + ly i+Z ~ 0 ' 


Thus an opportunity is afforded for a review of various improvements which have been in¬ 
corporated in the technique first propounded in Part III. In particular, this paper for the 
first time uses systematically the device of ‘graded nets’, i.e. nets of which the mesh-side a is 
smaller in some parts than in others. 


Introduction 

1 . Relaxation methods as developed in this series are direct extensions of a 
method first suggested for the calculation of stresses in frameworks— c the method 
of systematic relaxation of constraints ’ (Southwell 1935 ). Part I brought differential 
equations within their scope. Part III first dealt with equations involving two in¬ 
dependent variables. Part VI developed a technique for ‘characteristic value' 
(eigenwerte) problems. 

Part VII and papers subsequent thereto* have all been concerned with equations 
in two variables. Like Part III (which dealt with the equations of Laplace and of 
Poisson) they begin by substituting for the exact differential equation its approxima¬ 
tion in finite differences, thereby obtaining (in effect) a system of equations relating 
values of the wanted function at nodal points of a regular lattice or ‘net’, then 
they deal with those equations by a relaxational technique. Successive papers have 
treated governing equations of increasing order and complexity, or boundary con¬ 
ditions of special types; but from time to time they have recorded improvements 
of technique which have general application, though suggested by special problems. 

* Some of these papers have been adjudged to be ‘secret* during the war period, and 
accordingly are not included in the list of references. 

[ 258 ] 



Relaxation methods applied to engineering problems 


It was inevitable that relaxation methods should thus develop, but a consequence 
is that no paper in the series gives an exact account of those methods as now em¬ 
ployed; Part III in some respects is superseded, and later papers are concerned with 
particular applications. In this paper, therefore, we review the present state of 
relaxational technique as a preliminary to particular study of the equation 

0 / d\If\ 0 / dilr\ „ 


^dx) dy\ X dy) 


+ Z = 0 


in relation to problems where x and Z are known functions of x and y* When x 
is constant, (1) reduces to the 4 plane-harmonic equation 5 of Parts III, V, VII and 
VIII, and on this account we have ventured to describe its form as ‘ quasi-plane- 
harmonic 5 . 

Basic theory 

2 . When Z is zero, (1) reduces to 

(2) 

and the existence of a solution ^ implies that 


X dy dx ’ X dx 3 y’ K) 

therefore implies the existence of a solution <j> to the equation 

■ “> 

which results when ijr is eliminated from (3). When x is constant it can be cancelled 
out from (2) and (4), and then both ijr and <f> (its ‘conjugate’) are plane-harmonic 
functions: this is a familiar result. In general the associated functions i/r and <j> 
satisfy similar but not identical equations, described as ‘adjoint’. 

When s and v, measured along and perpendicular to a closed contour or boundary, 
have the senses shown in figure 1, then 


8 ®’ X dx 




8 , . 8 


■ sin (x, v) - 


^cos(*,>>)^ + sin(a,v)^,j 


and hence, when (3) are satisfied, 



v d _t- d l 

x ds~ 3v’ 


difr _ 0$5 
“ "dv ” 0s ' 


Figure 1 


This means that dr/r/dv = 0 at every point of a ^-contour (<f> = const.), dfi/dv = 0 at 
every point of a ^-contour (i/r = const.). These too are generalizations of familiar 
properties of conjugate plane-harmonic functions. 

* Part IX discussed an equation of this form (Z being zero); but there X related with 
^ by a second equation (the ‘equation of state’). Of. also Christopherson ( 1941 ): ‘Relaxation 
methods applied to problems of film lubrication.’ 


Vol. 183. A. 



260 D. N. de G. Aden, R, V. Southwell and G. Vaisey 

3 . When 0 in (1), an adjoint equation can still be formulated, 
function such that 


d*Q d*Q 

dx 2 + dy 2 


Z{x,y). 


Let Q be a 


(?) 


Then (1), ■written in the form 


shows that 



df d_Q __/ ao\ 

^dy + dy~dx’ dx dx) dy' 


( 8 ) 


and from these relations the equation 


dx 



dxd_QJ_xd_Q \ 0 

dx dy dy dx) 


(9) 


is obtained by elimination of ijr. Since x I s specified and Q is deducible from (7), the 
last term on the left of (9) may be regarded as a known function of x and y, corre¬ 
sponding with Z in (!).«% 


Some physical examples 

4 . The quasi-plane-harmonic equation occurs in many branches of mathematical 
physics. In electricity, for example, it governs the flow of current, and the electric 
potential, in a plane conducting sheet. Let i x , i y denote line-intensities of current 
in the directions Ox , Oy , and suppose that both surfaces of the sheet are insulated 
so that no current, on the whole, can enter or leave an element. This supposition 
may be stated in the equation 

fa %x+ ty l » = 0, (1) 

, i y can be expressed in terms of a single c current function’ 

difr . d i/r 


which shows that 
thus: 




dx' 


(ii) 


Let p denote the ‘ resistivity ’ of the conducting sheet, V the electric potential at the 
point x, y. Then, by Ohm’s law, 


dV . d^r 
-*z = P'x = P- 


di]r 


dv _ 

dx-^x-rdy’ dy~ piv ~ p dx’ 
and elimination of V and r[r, in turn, from these equations leads to 

3 


(iii) 


and to 


_3 

dx 


HKKH 


( 10 ) 



Relaxation methods applied to engineering problems 


261 


Equations ( 10 ) are analogous with ( 2 ) and (4), for which, accordingly, they 
suggest a physical interpretation: The function x> in ( 2 ), is a measure of some kind 
of 'resistance ’ ( p ) or 'admittance 5 (p* -1 ) which varies from point to point of the field. 
When it measures a resistance, the wanted function measures something having 
the nature of a flux; when an admittance, something having the nature of potential. 
Similarly in the theory of magnetism, if Q denotes the 'magnetic potential’, pb the 
‘permeability ’ of the material at any point (x, y, z) in a magnetic field, when every¬ 


thing is independent of z jso that — = ^ = oj, then .Q satisfies the equation 

d ( dQ\ 0 / dQ\ _ 
dx\pdx) + dyY'dy) ~ ° 5 


( 11 ) 


pb being (in general) a specified function of x and y. With ( 11 ) is associated the 
adjoint equation 

’KKKH. ' 


dx 


which governs the 'magnetic flux function’ and ( 12 ) and ( 11 ) reduce respectively 
to (2) and (4) when yr 1 (a quantity having the nature of resistance) is replaced by 
and Q (a potential function) by <f>. 

5. In systems characterized by symmetry with respect to an axis Oz, the quasi¬ 
plane-potential equation appears as a special form of Laplace’s equation. Thus in 
material of uniform thermal conductivity the temperature v, if steady, is governed by 


V 2 ?; = 0, 

which in cylindrical co-ordinates r, 6, z assumes the form 


(13) 


i.e. 


v — 0 , 


r Ji 1 3 _ 

\jdr 2 r dr dz 2 j 

d t dv\ 0 / dv\ ^ 
dr \ dr) dz \ dz) ~ °’ 


(14) 

(15) 


when (on account of axial symmetry) v is invariant with respect to 9. Equation 
(15) is identical in form with (2), v and r replacing fs and X- 
6 . A last example may be taken from the theory of torsional stresses in a bar of 
circular section but non-uniform diameter (Southwell 1942 , §§ 8 and 11 ). On certain 
assumptions (akin to those made in Saint Venant’s theory of torsion for prismatic 
bars of non-circular section), these consist solely of shear stress on all axial (z, r) 
planes, having components * 


dz = — 


ld$ 

r*dr’ 


1 30 

2 dz ’ 


(16) 


where <j> is a function of r and z, satisfying the condition 

<j> — const. 


(17) 



262 D. N. de G. Allen, R. V. Southwell and G. Vaisey 

along a generator of the (stress-free) boundary surface, and the equation 

dr \r 3 dr J dz \r 3 dz) 

at every point in the material. Equation (18) is (2) with 

x = r ~ 3 > ft — 4- 


( 18 ) 


(19) 


Membrane analogue of the quasi-plane-potential equation 


7 . A 6 membrane analogue’ of (1) can be stated. Suppose that the membrane 
tension T has an intensity, T^. which varies from point to point, and let i/r denote 
the (small) transverse displacement: then it is easy to obtain (1) as the condition 
of transverse equilibrium for an element at (x, y) under external transverse loading 
of surface intensity TZ. There is no difficulty in the notion of a variable tension, 
since we can suppose this to be maintained by forces acting always in directions 
parallel to the plane of the undeflected membrane, and as such having no influence 
on the transverse equilibrium. The surface intensity of these forces must have 
components 



( 20 ) 


so they will be derivable from a potential T%. 


The finite-difference approximation, and the 'relaxation pattern* 


8 . A relaxational treatment of (1)—which includes (2) as a special case—presents 
no difficulty. It is assumed that the equation is in 6 non-dimensional 9 form (so that 
x, y, x an d ifr are purely numerical quantities), and as a first step its finite- 
difference approximation is formulated. 

Referring to figure 2, we have the usual approximations: 


“(a), 


therefore (similarly) 




dft\ 

dx) 


ixi 


By a like argument we have 


™Xi(fti ~ fto) ~Xiuifto ~ fto)- 



™Xn(ftz ~ fto) ~ Xivifto “ fti )> 


(i) 


(ii) 


(in) 



263 


Relaxation methods applied to engineering problems 

and hence, as"the required approximation to (1), 

F„ = F 0 +K = 0, | 

where F 0 = a*Z 0> F 0 = Sfa. . r 4 (x r ).j 


( 21 ) 


In these equations I, II, III, IV (for which ^-values must be known) are points 
half-way between 0 and 1, 2, 3, 4 respectively. It is an easy matter to deduce from 
(21) the ‘relaxation pattern’ corresponding with an isolated displacement Afr 0 = 1. 
This is shown in figure 3a: it is very simple, and it reduces (as it should) to the 
standard plane-harmonic pattern when x = 1. 


2 

* 


3 x m, o , i, i 

m.. 


Figure 2 



a 



Figure 3 


Alternatively, since the quantity 
3 


dx 




. = 2 xV 2 ^ - +V 2 (x^ r )—X^V—^V 2 x 
= VW)+x v V - ^V 2 x. 


(i) 






264 D. N. de G. All$n, R. V. Southwell and G. Vaisey 

therefore using the standard (square-net) approximation 

a 2 (V 2 w) o i(w) - 4:W 0 (“) 

in relation to the terms of the right of (i), we have 

2F 0 = E a> iixft) ~ + Xo{^«, 4(*A) “ 4 ^o} - f 0 {^a, i(X) ~ 4 Xo} 

= ZMXx +Xo)l ~ fo ■ ZiiXi +Xo) (22) 

in replacement of the last of (21). The corresponding pattern is as shown in 
figure 36, reducing to identity with figure 3 os when 

2 C&> Xii> Xm> Xiv) is identified with Xo +(Xi> X 2 . %3= X<i)= 

and to identity with the standard plane-harmonic pattern when X = 1- 

Within the approximation of our finite-difference expressions for di/rfix, etc., 
figures 3 a and 36 are equivalent: that is to say, their differences are of a fractional 
order which is here neglected. An advantage of the treatment leading to (22) is 
that it can be applied as well to triangular nets (N = 6) if (ii) is replaced by its 
generalization 

N ^ (V 2 w) 0 i^E atN (w) - Nw 0 , ( m ) 

in which (cf. Part III, §8) terms of order a 4 are suppressed. Substituting from (i) 
and (iii) in (1), we obtain the first of (21) with expressions as under for F 0 , F 0 . 

F „ = Af ^ Z 0 , 2F 0 = FMXi +Xo)] - ^n(X 1 + Xo)- ( 23 ) 

* 

The second of these reduces to (22) when N = 4; and with an error of order (u 4 ) 
which is here neglected it can be replaced by 

K = ^(xi^i)-^-(Xi), ( 24 ) 

which, when N = 4, reduces similarly to the last of (21). Diagrams similar to figures 
3 a, b, are easily constructed to give, for N = 6, the relaxation patterns corresponding 
with (24) and (23) respectively. 


Net analogue of the finite-difference approximation 

10. The results of §§ 8-9 can be interpreted mechanically, as relating to a tensioned 
net in which the string tensions (like the membrane tension in §7) have different 
values in different parts; equilibrium in the plane of the undeflected net being main¬ 
tained (as in § 7) by an application of suitable forces which are without effect on the 
transverse equilibrium. 



Relaxation methods applied to engineering problems 265 

What follows is a treatment similar to that of Part VIII, §§ 6-10, but modified 
in that now the membrane tension is given by 

T = T X , (25) 

T being constant but x a specified function of x and y (§ 7). When N — 6 the loading 
TZ is concentrated at nodal points according to the rules of statics; but when N = 3 
or 4 it is best to concentrate at 0 (figure 4) the whole of the pressure acting on the 
surrounding polygon abc ...a. Then, for N — 6, the initial force at 0 is given by 

F 0 = lWTtan|[a 2 Z 0 + g (V»Z) 0 ] ; (i) 

when N = 3 or 4 it is given by 

F o = iWTtan^ T a%> simply. (ii) 








b r - 
! o\ 

- 1 CL 

f I i 



1 

_1 

O 

~!*j 








Figure 4 



As in Part VIII, § 8, we concentrate in a string OB of the substituted net (figure 5) 
the whole force which is the membrane tension T acting on the length kl = a tan n/N. 
Then, with neglect of quantities of order a 4 , 

T = Tatan^x I; , (iii) 

I denoting (as before) the middle point of the string: so we have the expression 

TAijrja = T tan 


(iv) 



266 


D. N. de G. Allen, R. V. Southwell and G. Vaisey 

for the force exerted by a string in consequence of a relative transverse displacement 
Aifr of its two ends; and hence the expression 

jP 0 = T tan Z N [XiWi ~ ( v ) 

for the total transverse force imposed at 0 by reason of displacements ^ 0 > • • • > ^n- 

11. Accordingly the condition of transverse equilibrium is 

F 0 =F 0 + J t 0 = O j (21) bis 

where F 0 and F 0 are given by (i) or (ii) and (v). But this condition is unchanged if 
F 0 , F 0 are multiplied in the same proportion, and moreover, since terms of order a 3 
have been neglected in obtaining (iv), there is no longer any justification in retaining 
the second term on the right of (i): therefore, cancelling (or equating to, unity) the 
factor T tan7 t/N, we now associate with (21) the expressions 

F "' lJWZ - I . (26) 

= — = Z N (xi 

which, for N = 4, are identical with the second and third of (21), § 8. 

The ‘force per string’ must be modified accordingly: i.e. in place of (iv) we 
now have 

TAijr/a - Xi A ft= 

and in place of (iii) T = c%. . (27) 

Because, with neglect of terms of order a 2 , 

2 Xi = Xi + Xo:—> etc., 

with neglect of terms of order a 3 the last of (26) is equivalent to (22), and (27) is 
equivalent to 

2T = a(Xo+Xx). ‘ (28) 

Cancellation of the factor T tan n/N is equivalent to the assumption that 

T = cot? t/N. (29) 

This compares with the assumption (14) of Part VIII, § 9, where, in consequence, 
the string tension T had a uniform value a, with the result that unit force was 
transmitted by any string by reason'of a unit difference in the displacements of its 
two ends. (The same result is obtained by making Xo an d Xi unity in (28). In the 
original treatment of Part III, § 18, the ‘force per string’ was made less in the ratio 
ljN, and a similar reduction was made in the expression for F 0 . The new convention 
has been found to have practical advantages.) 



Relaxation methods applied to engineering problems 


267 


‘IRREGULAR STABS 5 , AND THE USE OF ‘FICTITIOUS NODES 5 

12. To summarize: equation (21) with. F 0 , F 0 given by (23) or (26)—which are 
equivalent—may be used in accordance with the net analogy to solve the quasi¬ 
plane-potential equation. The corresponding ‘ patterns 5 are shown in figures 3 a, b, 
§ 8, and may be used as in previous papers of the series to deduce e block 5 and ‘ group 
displacements 5 . The technique of ‘hquidation 5 need not be described again. 

Figures 3 express the assumption that the tension in any string is 

T = ax* (27) bis 

= MXo+Xi) {28) bis 

when the third and higher powers of a are neglected. When % = 1 everywhere, both 
of these expressions reduce to 

T = a, . (30) 

as assumed in Part VIII, § 9. There, as a result, unit force was transmitted by any 
one string in consequence of a unit difference in the displacements of its two ends. 
According to (27) and (28) the force so transmitted is = i(x 0 +Xi)- 

13. No difficulty is presented by ‘ irregular stars ’ (i.e. by nodal points from which 
one or more strings radiate which are not of the standard length a), provided that 
we accept the consequences of the mechanical (net) analogue and assume that 
every string remains straight in the deflected net. The force transmitted by any 
string is proportional to its tension and to its slope, and the latter, for the same 
difference between its end displacements, is.increased in the ratio 1 /f when its length 
is £a instead of a. Therefore the ‘ force per unit displacement ’ is increased to 

|Xi - ^(Xo+Xi)> ( 31 ) 

or to 1 /£ when % = 1 as in Part III, 0, 1,1 denoting as before the ends and middle 
point of the string in question. 

In cases where the wanted function has specified boundary values it has become 
customary to employ the notion of ‘ fictitious points 5 lying outside the boundary and 
connected with internal nodes by strings of the standard mesh-length a. Then, if 
unit displacement is imposed upon the internal node while boundary points are held 
fixed , to keep the string straight a displacement — (1 — £) a[£, must be imposed at the 
same time on the fictitious node, denoting as before the length of string inside the 
boundary. This means that a differential displacement aj£ is imposed upon the two 
ends of a string of standard length a, and the expressions (31) result. 

‘Fictitious nodes 5 have been found useful in dealing with problems involving 
refraction , where a string on crossing an interface between two different media 
has to have its gradient multiplied in a specified ratio. This problem was discussed 
infPart III, § 34, with a result which in our present notation may be stated as follows: 
The discontinuous change of medium implies (according to the ‘net analogue 5 ) a 



268 D. N. de G. Alien, R. V. Southwell and G. Vaisey 

discontinuous change of string tension from T x (say) to T 2 . Let a x a be the length of 
string which has tension T 1? oc 2 a the length which has tension T 2 (so that oc 1 + a 2 = 1). 
Then the effective length of string in medium 1 is and the effective length of 

string in medium 2 is £ 2 u, where 

/ 

£l “ + . ^2 = a 2 + 

These formulae are to be combined with (31). 


TJse op ‘graded nets’: (1) when boundary values are specified 

14. As liquidation approaches finality, more accurate but less simple formulae 
can be used in place of (31) and (32), which have been derived on the understanding 
(cf. § 13) that strings remain straight. But greater labour is entailed by this pro¬ 
cedure, and in general it would seem better to retain the simpler formulae and to 
rely for close approximation on the fineness of the ultimate net. 

The use of ‘successive nets’ has been exemplified in several papers in this series, 
but there is an alternative which has not, so far, been used systematically: The 
accuracy of our finite-difference approximation depends upon the magnitudes of 
the neglected differentials, i.e. upon the rapidity with which the wanted function 
varies; consequently a finer net is needed in some parts of a ‘field ’ than in others , to 
define the function with precision. In hydrodynamic problems, for example, a fine 
net is required in regions where the velocities are large, but to extend this into regions 
of low velocity would entail great labour and would bring no compensating advan¬ 
tage. What is wanted is a technique involving specially fine nets in regions arbi¬ 
trarily delimited by the computer . This (a new development) we now explain. 

Example 1 

15. It may be illustrated by a problem which entails only one boundary condi¬ 
tion—the torsion problem for a rectangular bar having two symmetrical keyways 
(figure 6). The ‘stress-function’ W must satisfy the equation 

V 2 ^+2 = 0 (33) 

at every point in the section, and it must have a constant value (which without 
loss of generality may be taken as zero) at every point on the boundary (cf. Part III, 
§ 19). It is known that it will have, close to the keyways, gradients which are large 
compared with its gradients elsewhere; so a coarse net will suffice over most of the 
section, compared with the net which is appropriate there. On account of sym¬ 
metry, only one-quarter of the section needs to be reproduced in figure 6. 

In that diagram, outside of the region abc the mesh-side a is one-twelfth the length 
of a shorter side of the rectangle, or one-sixteenth the length of a longer side. (The 
rectangle has sides in the ratio 4:3.) Within abc, by insertion of diagonals, the mesBes 
have been successively reduced in size (and rotated through 45°) until, within the 


















270 


D. N. de G. Allen, R. V. Southwell and G. Vaisey 

region Imn, the mesh-side is only one-quarter of what it is outside of abc, and equal 
to the radius at each inner corner of the keyway. Near to such corners, therefore, 
^-values are available at nodal points sufficiently close to permit an accurate 
assessment of the ^-gradients (i.e. shear-stress components). 

16. In ‘ liquidating residuals’ on a net thus ‘graded’, care must be taken to 
ensure that correct ‘relaxation patterns’ are employed. Thus the residual force at 
h (figure 6) depends upon the ^-values at h , g, 6, d, e, but the residual force at b 
depends upon the ^-values at b, g, /, Jc, d, which are nodes of a coarser net; con¬ 
sequently W h influences but W n does not influence F & , and the notion (§ 12) of a 
force transmitted from h to b by the 4 string ’ hb, as the result of a ‘ displacement ’ W h f 
must be discarded. 

The ‘relaxation pattern’ for any nodal point 0 is a means of showing graphically 
at what points ‘residuals ’ are affected by a displacement of 0. From what has been 
said it follows that h is included in the pattern appropriate to 6, but that b is not 
included in the pattern appropriate to h. In figure 6, to assist an understanding of 
the liquidation process, an inserted diagram shows ‘patterns’ drawn in accordance 
with obvious conventions. These will not be needed by an experienced computer , who 
will see at once the consequences of a given ‘point-displacement’. 

17. But the fact (§ 16) that a point-displacement will not necessarily entail 
transmission of force by every radiating ‘ string ’ means that apparently the sum of 
the residual forces will not be invariant for every operation, as it is when uniform 
nets are employed. The point e, for example, is ‘fictitious’ in relation to the coarsest 
net of figure 6, and force transmitted to it as a consequence of displacement of g 
or d is not brought into account because F e (as now defined) does not depend on l F g 
or on W d , This circumstance entails no difficulty, and the solution given in figure 7 
was in fact obtained (by G.V.) easily and more quickly than it would have been if 
the finest net had extended to every part of the computational field, ^-contours 
show the direction and (by the closeness of their spacing) the magnitude of the shear 
stress at every point. 

Use of ‘graded nets’: (2) when boundary gradients are specified 

18. For nets having uniform Mesh-side a in every part, a technique applicable 
to plane-potential problems of the ‘ second class ’ (i.e. problems in which the boundary 
condition imposes values on the normal gradient) was propounded in Part VIII, 
§§ 11-13. This in effect replaced the specified membrane by one having a slightly 
larger polygonal boundary, the sides consisting either of strings of a chosen net or 
of lines which bisect its meshes (diagonals when these are square, medians when 
they are triangular). Clearly the substituted membrane should be no larger than is 
necessary, so here too advantage may be expected to accrue from a use of ‘ graded 
nets’ having specially fine meshes near the specified boundary.* 

* The same mil be true of ‘Method 2 5 of Part VIII, §§ 20-23, which is applicable to ‘mixed’ 
boundary conditio ns . 




Figure 7 



272 


D. N. de G. Allen, R. V. Southwell and G. Vaisey 

The technique will now be extended to 4 quasi-plane-harmonic 5 problems where 
(§ 7) the membrane tension T is non-uniform; the objective being to solve problems 
of the ‘second class' (normal gradients specified) with the highest accuracy that is 
attainable without excessive labour. The extension is based on a new interpretation 
of the standard formulae (21), § 8, and (26), §11; and it will be found that this suggests 
a modification, to improve its accuracy, in the treatment of edge-loading which was 
propounded in Part VIII. The effect is not important when the net is fine. 


Alternative derivation oe the standard formulae. 

Extension to any polygonal area 

19. The new interpretation of (21) and (26) is based on the exact governing 
equation (1). Multiplied through by T, and integrated in respect of any area in 
which it applies, this yields a condition of equilibrium for the corresponding element 
of loaded membrane. With a use of Green's transformation, and substituting from 
(25), we obtain 

j)T^~ds+ jjrz dxdy = 0, (34) 

in which (§7) TZ denotes the intensity of transverse pressure and (§ 10) T denotes 
the membrane tension at any point. The surface integral measures the resultant of 
the pressure, the contour integral measures the force exerted by the membrane 
tension. 



Estimation of the surface integral presents no difficulty, and the contour integral 
also can be evaluated for a region bounded by rectilinear elements. In figure 8, 
let KL be an element of this kind and 01 a line of length a which cuts it perpen¬ 
dicularly at I and which is itself bisected at that point. Then, in the notation of the 
diagram, 



(i) 



Relaxation methods applied to engineering problems 273 

and hence for the element KL, if Ox is the direction of the outward normal v. 


- t 


+ (terms in y % , y s , etc.)J dy, 
= T[(6 + c) Xl (gj + |(c 2 -& 2 ) 

when terms of order 6 3 , 6 4 , ..., c 3 , c 4 , ..., etc., are neglected. 


(ii) 


20. Three cases have practical interest. First, when c = b in figure 8, then (ii) 
may be written as 

J V SF* " *"*(!),• (fli) 

with the same approximation as before; and since 


+'-*> - “(S : ) I + *‘*(^) I + - 6tc - 


(35) 


it follows that with neglect (in comparison with unity) of a 2 and b % and of their higher 
powers the right-hand side of (iii) may be replaced either by 26T Xi(‘ l I f i~' 1 l f o)l a or 
by 6T(^ 0 + Xi) (^i — V^o) i a ' To the same approximation, the resultant of the membrane 
tension across KL 


= [ L Tdy = T f Xpdy = 26T% I? according to (i), 

KJ ~bj 

. = 6T(^ 0 + ^i), according to (35); 
consequently (iii) may be written in the form 

(V ~ ds for the element KL = T x — , 

J ov a (36) 

where T x = 2bTXi = ^ T (%o+Xi) - 

and the line 01 has the direction of the outward normal. 

Using this result we can formulate a condition of equikbrium for any polygonal 
element, thereby generalizing the results of § 11. The formula (36) confirms these < 



274 


D. N. de G. Allen, R. V. Southwell and G. Vaisey 


in the case (contemplated in § 11) of a regular polygon having N sides of length 
a tan n/N: for on substituting a tan njJH for 26 in (36) we have 


7r 


T = aTtan^x x , 


(37) 


agreeing with (27) when (as in § 11) the factor T ta nn/N is equated to unity. 


21. The second case having practical interest is that in which c = 0 in figure 8 
and the line 01 is part of an unloaded boundary . On that understanding 3^/9 y = 0 
everywhere along 01, and accordingly (d 2 f/dxdy)j in (ii) is zero; consequently if 
c = 0 then with the same approximation as before we have in place of (ii) 




bTxc^f-\ » according to (i), C denoting the middle 
' X 1 point of IK in figure 8, 


— Tj——— , according to (35), 
a 

where T* = bTx c * 


(38) 


The interpretation is the same as in § 20, except that now T|, in place of T x , is 
the tension to be associated with the string 01. When% = 1 everywhere, T* according 
to (38) = |T X according to (36), so the result given confirms the 'half-tension’ rule 
of Part VIII, § 12, in respect of a string which coincides with an element of a poly¬ 
gonal boundary. But our examination has not only generalized that rule for use in 
problems when x is variable, it has also shown that the rule has less than the standard 
order of approximation if the element of boundary is 'loaded’, because in that 
event (since (d 2 i/r/dx dy) T in (ii) no longer vanishes) a term of order b has been neglected 
(in comparison with unity) in deriving (38). 


22. The third case having* practical importance is that in which dijrjdv = 0 at 

(* ^ bilr 

all points in KL (figure 8). Then J T~^ds is zero, so no tension has to be given 

to the string 01 which is associated with the element KL. In other words, no string 
need be associated with an unloaded element of boundary . 


Application to the problem of specified boundary gradients 

23. To see the application of these results, assume (in the first place) that A 
(figure 9) is a polygonal boundary lying wholly outside of the specified boundary B. 
As in Part VIII, § 3, it may be said that specified values of dijrjdv are specified values 
of T(di/r/dv), so the problem presented is to compute the effects of a specified trans¬ 
verse loading applied to B. This loading we apply to B regarded as a line traced on 
the larger polygonal membrane , and we assume the outer boundary A to be unloaded . 



275 


Relaxation methods applied to engineering problems 

Clearly the solution of the problem as thus modified will differ from the wanted 
solution (for a membrane having the shape of B) only in so far as constraint is 
exerted by the membrane lying between B and A. 

This added membrane (shaded in figure 9) may be termed the c selvedge ’. It must 
(in general) exercise some constraint, because variation of ijr along B will entail 
curvature in the direction of the normal to B and so the normal gradient, though 
zero at all points of A, will not be zero on B; but clearly the constraint will decrease 




with the selvedge width, so the purpose must be to make this small. To this end the 
results of §§21 and 22 are utilized: Because A’s edge is unloaded no string of the 
relaxation net need cross it, and where convenient A may be drawn to pass through 
two adjacent nodal points (i.e. to coincide with a string) provided that with that 
string is associated a ‘half-tension’ Tj as given by (38). The extent of the relaxation 
net is determined by the consideration that strings must cross the loaded boundary 
B in order that the specified loading may be transmitted to nodal points. 

24. Id is here that we depart (in plane-harmonic problems) from the treatment 
which was given in Part VIII. There (§7) it was remarked that if distributed loading 
is to be brought to bear on a net in such a way that all strings remain straight when 


Vol. 183. A. 


*9 


276 


D. N. de G. Allen, R. V. Southwell and G. Vaisey 

the net is deflected, some mechanism must be envisaged for effecting the distribution: 
clearly the same is true of load which instead of being distributed over a surface is 
applied along a line as in § 23, and here an appropriate mechanism will consist of 
rigid rods conforming with the line of loading (short elements of which may be 
treated as straight) and transmitting the applied forces to their end-points on the 
strings. If as before the condition is imposed that all strings of the deflected net 
remain straight, other rigid rods will be needed to transmit these (concentrated) 
forces to nodal points; but then the deduced deflexion will have no discontinuity at 
the line of loading, so a wanted feature will be lost. Accordingly we now dispense with 
this last requirement as regards strings which are cut by B (§23): instead, we assume 
that these by deflecting transmit their loads to their (nodal) end-points. 

25. Let OB A (figure 10) be a string loaded at B with a concentrated force W , 
and assume in the first place that its ends 0 , A (which are nodal points) do not deflect 
(figure 10a). Then, in the notation of the diagram, forces 

W 0 = -W, W A --W (b + c = a) 
u a * a 

will be required, for equilibrium, at 0 and A respectively; and the deflexion of B 
will be 

bcW/aT = say, (i) 

T denoting the tension in the string OA. Now let forces equal and opposite to W 0 , 
W A act at 0 and A, and let i/r 0 , i]r A (figure 106) be the displacements (determinable 
by relaxation methods) which result from these and corresponding forces at other 
points. The consequent deflexion of B will be 

• Vb = (c^ 0 + 6^)/a* (il) 

and if, finally, the solutions (i) and (ii) are superposed so as to neutralize the forces at 
0 and A , the resultant displacement of B will be (figure 10 c) 

&B ~ ^B + + ~ $A * (3®) 

The displacements of internal nodes (typified by ^ 0 ) have been found already, and 
using (39) we can compute the boundary displacements, thus completing the 
solution. 

Because the relation (34), § 19, is derived from the exact equation (1) and so must 
hold in respect of the whole of the given membrane (i.e. of the whole area within the 
line of loading B, § 23), by assuming no force to act upon the selvedge (between B 
and A ) we shall be left with forces which sum to zero: consequently the relaxa- 
tional problem will be, tractable, as was argued in Part VIII, § 11. The treatment 
here described of loaded strings differs from what was described in § 12 of that 
paper, and may be expected to result in greater accuracy. 



Relaxation methods applied to engineering problems 


277 


Example 2 

26. Figure 11 presents a simple problem which will serve to exemplify these . 
methods and (since the exact solution is known) to test their accuracy. The membrane 
{and hence the net) is plane in the exact solution , whatever be the distribution of x : 
therefore for simplicity it has been assumed that x = 1 everywhere, so that the 
governing equation is plane-harmonic. 

Exact values of the displacement (ijr) are recorded in figure 11 at nodal points of 
one of the (square-mesh) nets that were used in computation. The corresponding 
gradients are calculable, so the. problem could be treated on the understanding that 
normal gradients of ijr are specified along the edges AB, BG , CD of the trapezoidal 
strip shown in thick line, values of \Jr are specified along the base AD. 



Figure 11. In calculating gradients 1 mesh-length has been taken as unity. 


27. Figure 12 a, b exhibit the relaxational solution of this problem (by G. V.) on 
two successive nets. Internal nodes are shown by black dots, strings which cross 
the loaded boundary are drawn to fictitious nodes distinguished by open circles, 
and the extent of the postulated c selvedge 5 is indicated, i.e. the shape of the unloaded 
outer boundary {A, §23). 

The correct value of *[r is 1024 at all points of BG, and along BA and CD f should 
vary linearly: even in figure 12a the computed values nowhere have an error 
exceeding 1-6 %, and in figure 126 the greatest error has fallen to 0*9 %. The correct 
value of the vertical gradient {dt/r/dy) is 256 at all points on BG: computed values 
of this quantity are plotted in the upper part of figure 12 a, and the errors are seen 
to be greatest at B and G and where BG is cut by horizontal strings. This is because 
eight vertical strings meet BG for every one horizontal string: in the correct solution 
the horizontal gradient {di/r/dx) has a constant value along BG and so accounts for 
a constant part of the uniform loading, but in the net this loading has to be resisted 
by vertical strings alone, except at three points where a horizontal string con¬ 
tributes, In figure 126 a similar plotting shows errors having a like explanation but 
slightly reduced in magnitude on account of the finer mesh. 

Example 3 

28. In Example 2 (consistently with §23) the polygonal boundary A was made 
either to coincide with or to lie outside the specified boundary B. But our aim in 
making the selvedge width small is to reduce to a minimum the additional 
constraint which it imposes, and clearly there is no objection to a 'negative 

19-2. 





Figuke 12 &. In calculating gradients 2 mesh-lengths have been taken as unity. 










Relaxation methods applied to engineering problems 279 

selvedge 5 (i.e. to letting A lie inside B), provided that the consequent reduction 
of constraint is small. In Example 3 (also a plane-harmonic problem) the chosen 
selvedge was in places positive but in other places negative, as shown in figure 13. 
Intuitively it is evident that the projecting 'tabs 5 (of 'positive selvedge 5 ) will 
exercise little constraint; for they embrace only short lengths of the boundary, 
and their outer edges are unloaded. 

This problem was treated in Part VIII, §§ 14-15, 
with results which were recorded in figures 9 of 
that paper. The specified normal gradients were 
chosen so as to make 

w = tan” 1 (y/x) — \ir (40) 

an exact solution (cf. Part VIII, equations (18), 

§14, and (22), §15). The errors of the relaxational 
solution were small, so cannot be much reduced 
by modifying the technique of computation in the 
manner suggested here: therefore to show the im¬ 
provement we have thought it best to record the 
'residuals 5 which are left, according to the original 
and to the amended technique, when the exact solu¬ 
tion (40) is assumed for the displacements . (An exact 
technique would yield zero residuals, and the more 
exact technique is that in which the residuals have 
the smaller values. Here, both techniques leave 
some residuals in all parts of the field of com¬ 
putation, because (as was remarked in Part VIII, 

§16) the wanted function (40) does not exactly 
satisfy the relation 

£ a A w ) - = 9 , 

which is the finite-difference approximation to the 
governing equation.) 

29. In figure 13, at internal nodes which adjoin Figure 13 

the circular boundary, numerals give the residuals 
computed in accordance with this paper (above) and 

with Part VIII (below). For this purpose, at internal nodes w was computed from 
(40), and at the fictitious nodes it was given their values required (according to each 
technique in turn) to make the residuals zero. Thereby residuals were transferred 
from the fictitious to the internal nodes, and the resulting values are recorded. 
Comparison reveals that a small but definite advantage is possessed by the modified 
technique: the algebraic sum of the compared residuals is —20-8 according to 
Part VIII, —10-2 according to the present paper; and their 'root-mean-square 5 



280 


D. N. de G. Allen, R. V; Southwell and G. Vaisey 

value is 19*2 according to Part VIII, 18*8 according to the present paper.* (The 
total of the applied load on the half circle was 2976.) 

Example 4 

30. Our last example is concerned with thermal conduction in a system having 
axial symmetry. Its governing equation is (15) of § 5, so it is a quasi-plane-potential 



The piston of an internal combustion engine receives heat from the products of 
combustion through its upper surface and transmits this mainly to the cylinder 
walls, which are cooled: some heat is transmitted through its lower surface to the 

* The algebraic sum is the better criterion, because positive and negative values tend in 
the relaxation process to meet and cancel. 


Relaxation methods applied to engineering problems 
















282 D. N. de G. Allen, R. V. Southwell and G. Vaisey 

contents of the crank-case, but this is relatively small and here it will be neglected. 
Similarly of the heat transmitted to the cylinder walls, because oil has low con¬ 
ductivity almost all goes through those surfaces which are brought most nearly 
into contact: these are (1) those surfaces of each piston ring and groove which are 
pressed together by reason of the (frictional and inertial) resistance to the piston’s 
motion, (2) the outer surface of each piston ring and that part of the cylinder wall 
against which it presses. 

Thus, by simplification of the physical conditions, the idealized computational 
problem shown in figure 14 is obtained. iVo heat is assumed to pass through any of 
the surfaces (other than the heated top surface) which are drawn in thick line, and 
no insulating effect is attributed to the thin oil films which separate the piston 
rings from their grooves and from the cylinder walls (so that in effect continuous 
metal is assumed to occupy the space shown by continuous shading). On the upper 
surface of the piston (AB, figure 14) the temperature is assumed to have a constant 
v x as= 100; and the cylinder wall, with those surfaces of the piston rings which touch 
it, are assumed to have another constant temperature v 0 = 0. The problem is to 
compute v for points within the metal of the piston, and thence to deduce the 
associated heat-flow. Clearly the isothermals will be c crowded 5 in the neighbourhood 
of the piston rings, so a use of graded nets is indicated. 

31. Computation (by D.N.deG.A.) confirmed the expected ‘crowding 5 and 
revealed that practically no heat is transmitted, in these circumstances, through the 
lowest of the piston rings. Consequently only a portion of the computation field 
(indicated by broken lines in figure 14) is represented in figure 15. There the grading 
of the relaxation net is shown, accepted values of v are recorded, and isothermal 
lines are drawn for intervals of 10 units. 

It was suggested to us by Mr J. F. Alcock that in future special alloys may be used 
for piston rings on account of the dominating part which these play in heat trans¬ 
mission, and that therefore it would be useful to investigate the consequences of 
improving their thermal conductivity. Accordingly it was decided to examine 
what would be the temperature distribution if the rings were infinitely conducting 
(so that v = 0 on the surface of contact of each ring and groove). Broken-line con¬ 
tours in figure 15 exhibit the isothermals in this limiting case. They call for no remark 
in this paper, excepting as they indicate that relaxation methods will deduce the 
consequences of any assumption that may be made. 

Conclusion 

32. When x — 1 everywhere in the computational field, equation (1), § 1, reduces 
to the ‘ plane-harmonic 5 (Poisson) equation which has been treated in previous 
papers. This paper examines the more general case in which ^ is a specified function 
of x and y . 

For orthodox methods the general case is far more diffi cult, but negligible modi¬ 
fications are required in the relaxational technique developed in this series. The ‘net 



Relaxation methods applied to engineering problems 283 

analogue’ holds as before, and it is necessary only to associate with each £ string 7 a 
tension (T) equal to the value of x Us middle point (cf. § 12), except in circumstances 
discussed in § 21, where equation (38) states a new generalization of the ‘half-tension 
rule’ of Part VIII, §12. The investigation has yielded a new interpretation of 
the basic finite-difference approximation, and this in turn has led to slight modifica¬ 
tions of the technique propounded in Part VIII for problems of 4 Class 2 ’ (normal 
gradients specified). 

Thus from a relaxational standpoint the 4 quasi-plane-potential’ equation differs 
negligibly from the plane-harmonic, and what is new in the present paper applies 
with equal force to both equations. For that reason plane-harmonic problems were 
chosen for three of the four examples, as being more familiar. The thermal problem 
solved in figure 15 will suffice to show the power of the relaxational technique. 


References 

Relaxation methods applied to engineering problems: 

Part I: Rradfield, K. N. E. & Southwell, R, V. 1937 Pvoc. Roy. Soc. A, 161, 155-181. 

Part III: Christopherson, D. G. & Southwell, R. V. 1938 Proc . Roy. Soc. A, 168, 317-350. 
Part V: Gandy, R. W. G. & Southwell, R. V. 1940 Phil. Trans. A, 238, 453-475. 

Part VI: Pellew, A. & Southwell, R. V. 1940 Proc. Roy. Soc. A, 175, 262-290. 

Part VII: Shaw, F. S. & Southwell, R. V. 1941 Proc. Roy. Soc . A, 178, 1-17. 

Part VIII: Southwell, R. V. & Vaisey, G. 1943 Proc. Roy. Soc. A, 182, 129-151. 

Part IX: Green, J. R. & Southwell, R. V. 1944 Phil . Trans. A, 239, 367-386. 

Christopherson, D. G. 1941 Proc. Instn Meek. Engrs, 146, 125-135. 

Southwell, R. V. 1935 Proc. Roy. Soc. A, 151, 56-95, and 153, 41-76. 

Southwell, R. V. 1942 Proc , Roy. Soc. A, 180, 367-396. 



Unitary representations of the Lorentz group 
By P. A. M. Dirac, F.R.S., St John's College, Cambridge 
(Received 31 May 1944) 


Certain quantities are introduced which are like tensors in space-time with an infinite 
enumerable number of components and with an invariant positive definite quadratic form 
for their squared length. Some of the main properties of these quantities are dealt with, 
and some applications to quantum mechanics are pointed out. 


1. Introduction 

Given any group, an important mathematical problem is to get a matrix representa¬ 
tion of it, which means to make each element of the group correspond to a matrix 
in such a way that the matrix corresponding to the product of two elements is the 
product of the matrices corresponding to the factors. The matrices may be looked 
upon as linear transformations of the co-ordinates of a vector and then each element 
of the group corresponds to a linear transformation of a field of vectors. Of special 
interest are the unitary representations, in which the linear transformations leave 
invariant a positive definite quadratic form in the co-ordinates of a vector. 

' The Lorentz group is the group of linear transformations of four real variables 
so> £i> such that £§ — £! — £| — £1 is invariant. The finite representations of this 
group, i.e. those whose matrices have a finite number of rows and columns, are all 
well known, and are dealt with by the usual tensor analysis and its extension spinor 
analysis. None of them is unitary. The group has also some infinite representations 
which are unitary. These do not seem to have been studied much, in spite of their 
possible importance for physical applications. 

The present paper gives a new method of attack on these representations, which 
was suggested by Pock’s quantum theory of the harmonic oscillator. It leads to a 
new kind of tensor quantity in space-time, with an infinite number of components 
and a positive definite expression for its squared length. 

2. THREE-DIMENSIONAL THEORY 

This section will be devoted to some preliminary work applying to the rotation 
group of three-dimensional Euclidean space. Take an ascending power series 

<h+^£i + a *!& + <hla+ ••• (*) 

in a real variable with real coefficients a r . Consider these coefficients to be the 
co-ordinates of a vector in a certain space of an i nfini te number of dimensions, and 
define the squared length of the vector to be 

S^r f ! a*. 

The series (2) must converge for the vector to be a finite one. 

[ 284 ] 


(2) 



285 


Unitary representations of the Lorentz group 

Take two more similar power series 2" b' s £, f and 2o°c,£| in the real variables £ 2 
and £ s and consider their coefficients to be the co-ordinates of vectors in two more 
vector spaces, with squared lengths defined by the corresponding formula to ( 2 ). 
Now multiply the three vector spaces together. A general vector in the product 
space will be a sum of products of vectors of the three original vector spaces, and 
its co-ordinates A rst can be represented as the coefficients in a power series 

P=2 0 “A rs( g£ta (3) 

in the three variables £ x , £ 2 , £ 3 . The squared length of such a vector is 

2“ r! s U! A*,#, (4) 

and two vectors with co-ordinates A rst and B rs , have a scalar product 

2 %r\s\t\A rsl B rsl . (5) 

If the variables £ 1: £ 2 , £ a are subjected to a linear transformation, going over into 
£j, £ 2 , £ 3 , say, the power series (3) will go over into a power series in £ 3 , £ 2 , £ 3 , 

P = 2AJ* £?•£?£*, 

in which the coefficients A' are linear functions of the previous coefficients A. Thus 
each linear transformation of the £’s generates a linear transformation of the 
coefficients A. . 

The theorem will now be proved: A linear transformation of £ x , £ 2 , £3 which leaves 
£f+£! + £! invariant generates a linear transformation of the coefficients A which 
leaves the squared length (4) invariant. Consider first the infinitesimal transformation 

k = & • ( 8 ) 

which leaves £ x + £1 + £| invariant, e being a small quantity whose square is negligible. 
Substituting into (3), one gets 

. P = 2A rg< (£i r £ 2 ®+re£i r_1 £ 2 s+1 - & 

Hence A' rst = A rst + (r + 1 ) eA r+liS _ w - (s + 1 ) eA r _ XjS+Xjt , 

in which A rst with a negative suffix is counted as zero. Thus 

Sr! 5 ! i5! A'f st — Sr! 5 ! t ! +2(r 4- 1) eA rst A r+ — 2(s +1) sA r _ ltS+ iiA rst ]. 

The last two terms in the [ ] here cancel, as may be seen by substituting r — 1 for r 
in the former and s — 1 for s in the latter, and hence the squared length (4) is invariant 
for the transformation (6). Any linear transformation of £ x , £ 2 > £3 which leaves 
£ 1 + £! + £! invariant can be built up from the infinitesimal transformation ( 6 ) and 
similar infinitesimal transformations with £ x , £ 2 and £ 3 permuted, together with 
possibly a reflexion £ x = — £ x , £ 2 = £ 2 , £ 3 = £ 3 , which obviously leaves the squared 
length (4) invariant, and hence the theorem is proved. 

The group of transformations of the £ J s which leave £f-f £f-f£§ invariant is the 
rotation group in three-dimensional Euclidean space, so the transformations of the 



286 


P. A. 3ML Dirac 


coefficients A provide a representation of this rotation group. One may restrict the 
function P to be homogeneous, of degree u say, and then the representation is a 
finite one. The coefficients A then form the components of a symmetrical tensor 
of rank u, the connexion with the usual tensor notation being effected by taking 
A rst to be ul/rlsltl times the usual tensor component with the suffix 1 occurring 
r times, 2 occurring s times and 3 occurring t times, as may be seen from the 
invariance of expression (3) with (£ 1? g 2 , g 3 ) transforming like a vector. 

One can make a straightforward generalization of the foregoing theory by intro¬ 
ducing other triplets of variables, t/ 19 tj 2 , ?/ 3 and £ ls £ 2 , £3 say, which transform to¬ 
gether with g 2 , £ 3 , and setting up a power series in all the variables. The trans¬ 
formations of the coefficients of these more general power series will provide further 
representations of the three-dimensional rotation group. If such a more general 
power series is restricted to be homogeneous, its coefficients will form the components 
of an unsymmetrical tensor. 


3. FOUR-DIMEHSIONAL THEORY 
Take a descending power series 

v&+*i/a+iwa+^a+--. w 

in a real variable £ 0 with real coefficients k n . Consider these coefficients to be the 
co-ordinates of a vector in a certain space of an infinite number of dimensions, and 
define the squared length of the vector to be 

(8) 

Multiply this vector space into the vector space of the preceding section. A general 
vector in the product space will have co-ordinates A nrsl which can be represented 
as the coefficients in a power series 

Q = ^A nrst ^- x Zimi (9) 

in the four variables £ 0 , £ l9 £ 2 , £ 3 . The squared length of such a vector is 

^nl^r\e\t\Al rsti (10) 

and two vectors with co-ordinates A nrst and B nr$t have a scalar product 

Z?n\-irls\t\A nr$i B nrst . ( 11 ) 

The series (7) may be extended backwards to include some terms with non¬ 
negative powers of £ 0 , so that coefficients k n occur with negative ^-values, leading 
to coefficients A nrsf with negative ^-values. Since n ! is infinite for n negative, these 
new coefficients do not contribute to the squared length of a vector or the scalar 
product of two vectors. Thus the terms with non-negative powers of £ 0 should be 
counted as corresponding to the vector zero, and whether they are present in an 
expansion or not does not matter. 



287 


Unitary representations of the Lorentz group 

Now apply a Lorentz transformation to the £’s, 

= «££, ( 12 ) 

the a’s satisfying certain conditions so that - £f - £§—£% is invariant. This makes 
£i£Us go over into a finite polynomial in £' 0> £[, £ 2 , and £^ n ~ x go over into 

(<U 0 +ci&+a%&+ a ttz)- n -\ (13) 

which may be expanded in ascending powers of £{, £' 2 , and descending powers 

of £o- (The question of the convergence of this expansion is left to the next section, 
so as not to break the main argument here.) The power series (9) then goes over 
into a series 

Q = (14) 

in ascending powers of £ x , £ 2 , £ 3 and descending powers of g 0 , with coefficients A' 
which are linear functions of the previous coefficients A. There may be terms in 
(14) with non-negative powers of £ 0 , but on account of what was said above these 
can be discarded. If such terms are present in the original series (9), they will not 
affect any of the coefficients with non-negative %-values in (14). 

The Lorentz transformation thus generates a linear transformation of the coeffi¬ 
cients A rtrsl with non-negative ^-values. The theorem will now be proved: The 
transformation of the coefficients A leaves the squared length (10) invariant. 

Consider first the infinitesimal Lorentz transformation 


Substituting into (9), one gets 




(15) 


Q = XA nrsl [£' 0 -”-%r-(n+ l)«ft-*-¥i 


Hence A nrS f A /irs g neA n _ x>r _ (r+ l)®^i«+i,i-+i,s ) o 

in which A nrU with a negative value for r, sort is counted as zero. Thus 




nrst 


! 1 T ! S ! t ! [A. nrs t 2716 A nrs ^ A + 2{r + 1) ^■^■n+l^r+xsft^-nrsti' 


The last two terms in the [ ] here cancel, as may be seen by substituting n +1 for 
n in the former and r — 1 for r in the latter, and hence the squared length (10) is 
invariant for the transformation (15). Any Lorentz transformation can be built up 
from infinitesimal transformations like (15) and three-dimensional rotations like 
those considered in the preceding section, together with possibly a reflexion, which 
obviously leaves the squared length (10) invariant, and hence the theorem is proved. 

The transformations of the coefficients A thus provide a unitary representation 
of the Lorentz group. The coefficients themselves form the components of a new kind 
of tensor quantity in space-time. I propose for it the name expans or, because of its 
connexion with binomial expansions. One may restrict the function (9) to be 
homogeneous and one then gets a simpler kind of expansor, which may be called a 



288 


P. A. M. Dirac 


homogeneous expansor. The analogy with the three-dimensional case suggests that 
one should look upon a homogeneous expansor as a symmetrical tensor in space- 
time with the suffix 0 occurring in its components a negative number of times. 

The foregoing theory can be generalized, like the three-dimensional theory, by 
the introduction of other quadruplets of variables, tj 0 , %, ?/ 2 , r) z and £ 0 , £ l9 £ 2 , £3 say, 
which transform together with £ 0 , £ x , £ a , £ 3 . One can then set up a power series in 
all the variables, ascending in those variables with suffixes 1, 2 and 3, and descending 
in those variables with suffix 0 . The transformations of the coefficients of these more 
general power series will provide further unitary representations of the Lorentz 
group, and the coefficients themselves will form the components of more general 
expansors. 

There is another generalization which may readily be made in the theory, namely, 
to take the values of the index n in (9) to be not integers, but any set of real numbers 
n 0 ,n 0 +l,n 0 +2, ... extending to infinity. In the formula ( 10 ) for the squared length 
nl is then,to be interpreted as r(w + 1 ). The terms with negative ^-values can no 
longer be discarded. The expression for the squared length is still positive definite 
if the minimum value of n is greater than — 1 , which is the case if the function ( 9 ) 
is homogeneous and its degree is negative. The resulting representation is then still 
unitary. If, however, the function (9) is homogeneous and its degree is positive, 
there will be a finite number of negative terms in the expression for the squared 
length. The resulting kind of representation may be called nearly unitary . 


4. Some theorems on convergence 

If the series ( 2 ) is convergent, then ( 1 ) is convergent for all values of Similarly, 
if (4) is convergent, then (3) is convergent for all values of £*, g 2 , £ 3 . On the other 
hand, if ( 8 ) is convergent, (7) need not be convergent for any value of £ 0 , in which 
case, of course, it does not define a function of £ 0 . Similarly, if ( 10 ) is convergent, 
(9) need not be convergent for any values for the §’s. Thus, corresponding to a general 
expansor A nr$t , there need not exist any function Q of the £’s. However, it will now 
be proved that if the series (9) is homogeneous and (10) is convergent , then (9) is- 
absolutely convergent for all values of the £fs satisfying 

S-S-3-S>o. (16) 

If the series (9) is homogeneous of degree u — 1 , it may be written 

(i7) 

where 2 S means a sum over all values of r, a and t satisfying 

r+s+t = n+u. (18) 

With this notation the series ( 10 ) may be written 


( 19 ) 



289 


Unitary representations of the Lorentz group 

Now apply Cauchy’s inequality 

(x 1 y 1 +x 2 y i +x s y 3 +...) z ^{xl+xl+xl+...)(yl+yl+yl+...) " (20) 

in the following way. Take ( r + s + t)\/rls\t\ of the a:’s in ( 20 ) to be each equal to 
HHii and the corresponding y’s to be each equal to A nnt r\s\t\/(r+s+t)\, and do 
this for all r, s, t subject to (18) for a fixed value of n. Then ( 20 ) becomes 

V s A nrst Zimi? < (£! + £1 + ® n+u (» + U) !-i S s r\s\t\ A\ rsl . (21) 

If the sum with respect ton in (19) converges, there must be some number k such that 

rcH-EgrlsU! A\ rsl <K 

for all n. Then from ( 21 ) 

| ' n ~ 1 'Si S A nrgl i{ 

which shows that (17) is convergent when (16) is satisfied. One may take £ 0 , | x , £ a , £ 3 
and the Jl’s to be all positive without disturbing the argument, so (17), considered 
as a quadruple series in n, r, s, t, is absolutely convergent. 

Thus for any homogeneous expansor of finite length, there exists a function Q 
given by (9) defined within the light-cone (16). In transforming this function with 
a Lorentz transformation ( 12 ), it is legitimate to use the expansion of (13) in 
ascending powers of £,[, | 3 , £3 and descending powers of i' 0 , for the following reason. 
Suppose i p is within the light-cone and take for definiteness i 0 > 0 . Then 

ao£o+ao£i+“o£2+ao£3>0. ( 22 ) 

One can change the sign of any of the co-ordinates i' x , i 2 , £ 3 , leaving £0 unchanged, 
and in will still lie within the light-cone with £ 0 > 0 , so ( 22 ) must still be satisfied. 
Hence 

a o£o~ I a oil | — | a 0^2 | ~ I a 0^3 | > 0, 

which shows that the expansion of (13) in the required manner is* absolutely con¬ 
vergent. The use of this expansion in the preceding section is thus justified for the 
case when (9) is homogeneous, the new coefficients A being determined by the 
transformed function Q within the light-cone. The justification for (9) not homo¬ 
geneous then follows since terms .of different degree do not interfere. It should be 
noted that all the foregoing arguments are valid also when the values of n are not 
integers. 

There are some expansors which are invariant under all Lorentz transformations, 
namely those whose components are the coefficients of (£§ —£§ —gl)"”* expanded 
in ascending powers of £ 1? and descending powers of £ 0 . For such an expansion 
to be possible, l must not be a negative integer or zero, but it can be any other real 
number. Again, there are expansors which transform like ordinary tensors under 
Lorentz transformations, namely those whose components are the coefficients in 
the expansion of 




((to + w)! 


*m+z i+zi\* n+u) 
a 1 5 




( 23 ) 



290 


P. A. M. Dirac 


where / is any homogeneous integral polynomial in £ 0 , £ l5 and l is restricted 

as before. (23) transforms like the polynomial f, and thus like a tensor of order 
equal to the degree of /. One would expect all these expansors to be of infinite 
length, as otherwise one could set up a positive definite form for the squared length 
of a tensor in space-time. A formal proof that they are is as follows. 

Suppose the / in (23) is of degree u and express it as 

where and each of the g ’s is a polynomial in £ 1? £ 2 , only, of degree 

indicated by the suffix. The successive terms here contribute to (23) the amounts 


9 u ( Z 2 o -¥)- 1 


9u-il o(f!-5 2 )-* 


_ V co 

_ V 00 


l(l+l)(l+2)...(l+m-l)g u % 2m 
ml g$ (m+I) ’ 

l(l+l)(l + 2)...(l+m-l)g u _ 1 %^ 

ml £K.m+l)-V • 


m= - 1 (m + 1 )! §§(»»+0 ’ 


9u-zUZl~W +1 


m=_1 (m+1)! g2(m+o-i' > 


and so on. These expansions show that, for large values of m, the terms arising from 
g u _ 2 , g u _ 4 , ... are of smaller order than the corresponding terms arising from g u , 
and the terms arising from g u _ 3 , g u _ 5 , ... are of smaller order than the corresponding 
terms arising from g u _ lt so that in testing for the convergence of the series which 
gives the squared length, only the g u and g u _ 1 terms need be taken into account. 
(It will be found that the convergence conditions are not sufficiently critical to be 
affected by this neglect.) Now express g u and g u _ x as 


g u = S u + V'S-a-i + I 4 $u~4 + •.., 


9u-i = S«-x+ 5 2 ^_ s +^_ 5 + ..., 


where the $’s are solid harmonic functions of £ x , f 2 , £3, of degrees indicated by the 
suffixes. Each of them gives a contribution to (23) of the form 


or 


?00 

J m=0 


• = Su- 2 r%, 


(m + l- l)!52(m+r) 


m!(l — 1)! 

(m+l- 1)! § 2 < m + r > 
to!(Z— 1)! 


(24) 

(25) 


Using the result (41) of the appendix, one finds for the squared lengths of the 
expansors whose components are the coefficients of (24) and (25), series‘of the form 


v (m + l— l)! 2 4 m+r (m+r)!(m+u-r+.-|)! 
C m m! 2 (Z— 1)! a ■ (2m + 2Z — 1)! ~ 


y + 4 m + r (m + r) \ + u—r ~ 

C ~ lm m! 2 (Z— 1)! 2 (2m + 2!! —2)! * ' 


and 



291 


Unitary representations of the Lorentz group 

respectively. In both these series the ratio of the (m +1 )th term to the mth is l + u/m 
for large m, and since u is necessarily positive or zero, the series diverge and the 
expansors are of infinite length. Prom the result (40) of the appendix, expansors 
associated with solid harmonics of different degrees are orthogonal to one another, 
and hence the total expansor with the coefficients of (23) as components must also 
be of infinite length. 


5 . A TRANSrORMATION OF VARIABLES 

Up to the present, expansors have always been considered in connexion with 
certain ^-functions, the components of the expansor being the coefficients of the 
g-function. In the case of integral n-values, one can make a transformation of 
variables of the kind which is familiar in quantum mechanics, and get the expansors 
connected with some other functions, which serves to clarify some of their properties. 
Introduce the four operators x p defined by 

2% = £ 0 -a/ago, 2** f = g r + 8/0£ r (r-1,2,8). (26) 

Under Lorentz transformations they transform like the components of a 4-vector. 
The operators d/dx ft may be taken to be 

2* d/dx 0 - £ 0 + 3/3£ 0 , 2* 8/a* - f -&+8/8&, (27) 

as this gives the correct commutation relations between all the operators. One can 
now represent ^-functions with integral u-values by functions of the ac’s according 
to the following scheme. 

Take as starting point the ^-function §j* 1 . It vanishes when operated on by 
3/S£ 1? 3/3g 2 , or 3/9£ 3 , and also effectively vanishes when multiplied by £ 0 , since the 
result of the multiplication can be discarded. It must therefore be represented by 
a function of the x's which vanishes when operated on by x r + djdx r with r = 1, 2 or 3, 
or by Xq + d/dx 0 , and thus by a multiple of where x 2 = asf+#l + x%. Take 

ST x s7' (28) 

where the sign = means 'represented by 5 . Multiply the left-hand side of (28) by 
the operator ( —8/3£ 0 ) n gggJ and the right-hand side by the operator equal to it 
according to (26) and (27). The result is, after dividing through by n\, 

• = ^nrat( x 0 > *i> aj a)s (29) 

say. This gives the function F of the afs which represents g-n-iggfijj. A general 

^-function is now represented thus, 

»*»*»**)• ( 80 ) 

In this way a general expansor A nrat gets connected with a function of the %’s. 

30 


Vol. 183. A. 



292 


P. A. M. Dirac 


The chief interest of this connexion is that the law for the scalar product of two 
expansors becomes very simple when expressed in terms of the functions of the *’s. 
The scalar product is the integral of the product of the two functions of the %’s over all 
x 0 , x v x z and x z . To prove this, first evaluate the integral 

<31) 

where the dot has the meaning that operators to the left of it do not operate on 
functions to the right of it. For m > 0, (31) goes over by partial integration into 




Applying this procedure m tipies, one gets zero if m'cm and 2 m m ! n- if ml = m. 
Since there is symmetry between m and the result is 


/: (*- s)" ^ ■ (• - if e_H ’ * = ^ 1 ”* ,w - • 

Now substitute for z each of the variables x Q , x v x 2 , x 3 in turn, with m equal to n, r, s, t 
andm' equal to n\ r\ s', t' respectively, and multiply the four equations so obtained. 
The result, after dividing through by 7 r 2 nl 2 2 n+r+8+t i is 


j j J jKrsAYsr dx o dx i d x 2 dx 3 = 

This proves the theorem for the case when the expansors each have only one non- 
vanishing component, which is sufficient to prove it generally. 

One can obviously get a unitary representation of the Lorentz group by con¬ 
sidering the transformations of a set of vectors in a space of an infinite number of, 
dimensions, where each vector corresponds to a function of four Lorentz variables 
x 0 , x x , x 2 , x 3 , and where the squared length of a vector is defined as the integral of the 
square of the function over all x 0 , x x , x 2 , x 3 . The above work shows how this obvious 
unitary representation is connected with the expansor theory. 


6 . Applications to quantum: mechanics 
The four of the preceding section may be looked upon as the co-ordinates 
of a four-dimensional harmonic oscillator, the four operators idjdx^ being the con¬ 
jugate momenta pf*, and the energy of the oscillator may be taken to be the Lorentz 

invariant i[xl+xl+xl-x* + (pL)* + (p*)*+(p*)*-(p*n (32) 

The 1, 2, 3 components of the oscillator thus have positive energies and the 0 com¬ 
ponent negative energy. 



293 


Unitary representations of the Lorentz group 

The state of the oscillator for which the 0,1,2, 3 components are in the nth., rth, 
5 th, tt h quantum states respectively is then represented by the function F nrst defined 
by (29), with a suitable normalizing factor. This representative may be transformed 
to the ^-representation and becomes £ 3 . Thus a state of the oscillator for 

which each of its components is in a quantum state corresponds to an expansor with 
one non-vanishing component. A general state of the oscillator therefore corresponds 
to a general expansor with integral ^-values. A stationary state of the oscillator 
corresponds to a homogeneous expansor, the degree of the expansor giving the 
energy of the state with neglect of zero-point energy. 

Four-dimensional harmonic oscillators of the above type occur in the theory of 
the electromagnetic field. Each Fourier component of the field, specified by a par¬ 
ticular frequency and a particular direction of motion of the waves, provides one 
such oscillator, its four components coming from the four electromagnetic potentials. 
Thus a state of the electromagnetic field in quantum mechanics is described by a 
number of expansors, one for each Fourier component. By using the electromagnetic 
equation which gives the value of the divergence of the potentials, one can eliminate 
in a non-relativistic way the 0 component and one other component of each of the 
four-dimensional oscillators, so that only two-dimensional oscillators are left. This 
circumstance has made it possible for people to develop quantum electrodynamics 
without using expansors. 

Another possible application of expansors is to the spins of particles. The wave 
function describing a particle may be a function of the four co-ordinates of the 
particle in space-time and also of the four variables ^ whose coefficients are the 
components of an expansor. As a simple example of relativistic wave equations for 
such a particle in the absence of external forces, one may consider 

The first of these is the usual equation for the motion of the particle as a whole. The 
second shows that at each point in space-time the wave function ^ is homogeneous 
in the g’s of degree - 1 . The third shows that the state for which the momentum- 
energy four-vector of the particle has the value p (i is represented by the wave, 
function 

? fr — m . e -Hp Q x Q -(yt)]Ih 

^-£oPo-(5P) 

in three-dimensional vector notation. This may be expanded as 

f = E ”_o ^n+J^n +l ' ( 34 ) 

For the state for which the particle is at rest, p x = p % = == 0 , p 0 = m, and ijr 

reduces to 


f = 


20-2 



294 


P. A. M. Dirac 


This ijr is spherically symmetrical, showing that when the particle is at rest it has 
no spin. But when the particle is moving, it is represented by the general ifr (34) 
and has a finite probability of a non-zero spin. In fact, taking for simplicity 
p 2 = p s = o, the particle has a probability (mp%/p% +1 ) 2 of being in a state of spin 
corresponding to the transformations of under three-dimensional rotations. 

This example shows there is a possibility of a particle having no spin when at rest 
but acquiring a spin when moving, a state of affairs which was not allowed by previous 
theory. It is desirable that the new spin possibilities opened up by the present theory 
should be investigated to see whether they could in some cases give an improved 
description of Nature. The present theory of expansors applies, of course, only to 
integral spins, but probably it will be possible to set up a corresponding theory of 
two-valued representations of the Lorentz group, which will apply to half odd 
integral spins. 


Appendix 

• The rules (5), ( 11 ) for forming scalar products are not always convenient for direct 
use. There are various ways of transforming them and making them more suitable 
for practical application. One such way has been given (Dirac 1942 , equation 3*22) 
for the case of a single £ with ascending power series. Another way, applicable to 
the case of homogeneous functions of £ x , £ 2 , £ 3 , is provided by the following. 

By partial integration with respect to £', one gets, for m > 0, v 

” gmg'n e m' d g d ^ = f" + in[ “ t/Kdg 

-00 J -00 IL 00 J -CO 

If the integrals are made precise in the sense of Cesaro, which means neglecting 
oscillating terms like £' n e*&' for £' infinite, this gives 



JJ” £ m £' n e^d£d£' = m JJ" gm-ign-i e iig' d g d g'. 

Taking m^n and applying the partial integration process n times, one gets 

JJ” d£d£' = i n n JJ” £ m ~ n eW d£d£' 

= 2ni n n ! f" £ m ~ n S{£)d£ 

J —00 


= 2m n n\d„. 


(35) 


It follows that if A and B are homogeneous functions of £ x , £ a , £ s of degree u, their 
scalar product according to ( 5 ) is 

(45) = (2j7)-s;-“JJ 4 (^ 3 ) £(££'£') e«h&S2h+U&d£ 1 d£' 1 d£ i d£' 2 d£ a d£' 3 . (36) 




295 


Unitary representations of the Lorentz group 

As an application of this rule, take 

A = U+Zl + m r Su-*r, B — (£i+£!+£i)*'Sa_as> 

where the S’b are solid harmonic functions. Then, using three-dimensional vector 
notation, (36) gives 

(AB) = (2 ... d&. (37) 

From Green’s theorem 

equals a surface integral of an oscillating kind which is to be counted as vanishing 
at infinity. This result reduces to 

4cr(u —r + «* i(S,) *kd&d& 3 +r 2 JJ« i(ir) 

= 0 . 

so (37) becomes, for r. s > 0, 

(AB) = (27r)-3i-“+ 2 M«-r + *) JJ ) S u _^)8^') ... 

(38) 

Now suppose s>r and apply the procedure by which (37) was changed to (38) r 
times. The result is 

(A B) = (27r) -3 i-“+ 2r 4 r r !(m —r + i)! (w — 2r + 4) ! _1 

xJJ (39) 

where n! means r(w +1) for n not an integer. If s > r, the procedure can be applied 
once more, and then shows that 

(A.5) = 0 for r+s. (40) 

If s = r, (39) shows that (AB) = c4 r ?-! (w—r +1)!, (41) 

where c depends only on u—2r and on the two S functions. 


Reference 

Dirac, P. A. M. 1942 Proc. Roy. So c. A, 180 , 1 - 40 . 






An investigation of the performance of the Rayleigh disk 
By R. A. Scott, Research Department, Metropolitan Vickers Electrical Co. Ltd. 
(Communicated by Professor D. R. Hartree, F.R.S.—Received 2 May 1944) 

The paper describes an investigation of the nature of the law which governs the torque on a 
Rayleigh disk suspended in a sound field. The torque is found to follow an expression of the 
form obtained theoretically by Konig, provided that factors such as the finite thickness of 
the disk and the lack of infinite inertia of the disk are taken into account. The measurements 
were made at frequencies spread over the range 250-4000 cyc./sec. The correction to the 
Jorque in Konig’s formula made necessary by the thickness of the disk is shown to be large 
and to have a sign opposite to that determined theoretically by Konig for ellipsoidal disks. 
The influence upon the torque of the proximity of the wall of a tube has been investigated. 
The nature of the precautions which should be observed in choosing the form of a Rayleigh 
disk are discussed with reference to reducing to a minimum the effects on the torque of the 
thickness and mobility of the disk and of diffraction of sound by the disk. 

- An investigation has been made of the numerical factor which relates the magnitudes of 
the torque to the other variables of Konig’s formula. This part of the investigation reduces 
essentially to the measurement of the oscillatory velocity of the field in which the disk is 
situated. The smoke-particle method of Andrade and of Carriers was adapted for the above 
purpose, and it was found possible to make determinations of oscillatory velocity within 
about 1 % for frequencies up to 4000 cyc./sec. 

Measurements have been made of the torque on a Rayleigh disk of known dimensions in 
terms of the oscillatory velocity for frequencies in the range 250-4000 cyc./sec. Discre¬ 
pancies of 3*5 % in velocity (7 % in torque) are shown to follow from the use of Konig’s 
formula, and these results confirm, and extend into the audio-frequency range, the investiga¬ 
tions of Merrington & Oatley in the band 9-22 cyc./sec. 

The experimental work confirms that the stability of behaviour of the Rayleigh disk 
justifies its continued use as a reference standard of acoustical intensity but shows that in 
setting up the reference standard, corrections amounting in extreme cases to 0*3 db. must 
be applied to the intensity as determined from Konig’s formula. 


1 . Introduction 

The Rayleigh disk has long received attention as a means of providing a standard of 
sound intensity. Its simplicity of construction, and the ease with which it can be 
used under suitable experimental conditions, alike make it attractive and have led 
to its common use as a standard against which to calibrate microphones. Rayleigh 
(i 88 o, 1882 ) showed that a torque is exerted on a thin circular disk when immersed 
in a flowing medium. The torque tends to turn the disk at right angles to the direction 
of flow. Konig ( 1891 ) determines mathematically the torque on such a disk on the 
assumption that the flow is streamline and steady. He found that an infinitely thin 
disk of radius a, when set so that its normal makes an angle 6 with the direction of 
flow, should experience a torque given by the expression 

L = ±pa?V*sm2d, ( 1 ) 

where p is the density of the medium (air) and V is the velocity of the flow. Konig 
showed that the effect of viscosity of the medium is, to a first approximation, to 
produce a thrust and not a torque. The expression ( 1 ) shows that the torque remains 

[ 296 ] 



An investigation of the performance of the Rayleigh dish 297 

unchanged in sign when the velocity of the medium is reversed, and it is for this 
reason that the Rayleigh disk finds application in acoustics. 

When a disk is placed in a field of oscillatory velocity, L and V must be taken to 
stand for the mean value of the torque and the root-mean-square velocity respec¬ 
tively, and Konig’s equation in this form serves as a basis of acoustical measurement. 
Various investigators (e.g. Zernov 1908; Barnes & West 1927) have examined the 
applicability of Konig’s formula to acoustic fields, but almost all of the work has 
been carried out at the very lowest end of the audio-frequency range. Until recently 
no serious fault has been found with the formula, apart from small corrections which 
have been necessary for the effect of the finite thickness and the lack of complete 
immobility of the disk. Herrington & Oatley (1939) observed, however, whilst 
working at very low frequency (9-22 eye./sec.), a discrepancy of 10 % between the 
torque on the disk and that deduced from Konig’s formula. This discrepancy was 
attributed to such steady circulations of air about the disk as are under given con¬ 
ditions to be found around obstacles in a sound field (Andrade 1931). Herrington & 
Oatley demonstrated that circulations of the appropriate type are to be found about 
the Rayleigh disk under the conditions of their experiments. 

The discovery that the torque on a Rayleigh disk as calculated from Konig’s 
formula can be in error at low frequency by as much as 10 % throws doubt on calibra¬ 
tions made at all frequencies. It is therefore important to investigate the accuracy 
of Konig’s formula over the entire range of audio-frequency. In the present paper 
the behaviour of the Rayleigh disk over the range of frequency 250-4250 cyc./sec. 
is investigated. The general form of the expression for the torque as found experi¬ 
mentally is described in the first part of the work, whilst in the second part an 
account is given of an experimental comparison between oscillatory velocity as 
determined on the one hand by application of Konig’s formula to the Rayleigh disk, 
and on the other hand from the excursions of smoke particles entrained in the sound 
field. 


2 . Determination of the form of the law governing 

THE TORQTJE ON THE RAYLEIGH DISK 

Before Konig’s formula is applied to the case of an actual Rayleigh disk, the 
influence on the torque of the following factors, which were not adequately dealt 
with in Konig’s analysis, must be investigated: 

(a) The finite thickness of the disk. 

(b) The lack of infinite inertia of the disk. 

(c) The diffraction of the sound field by the disk. 

(d) The restriction upon the streaming near the disk caused by the proximity of 
the wall of any enclosing tube. 

Each of these factors has in the past received some attention, but only (6) has been 
investigated satisfactorily. The corrections are by no means negligible, and were 
therefore investigated in detail. 



298 


R. A. Scott 


(a) Finite thickness of disk 

No theoretical expression has yet been obtained for the torque on a short cylinder 
immersed in a sound field. Konig’s paper includes a derivation of the following 
expression for the torque on an ellipsoid of circular principal section of radius a: 

L = fpa 3 F 2 sin 26(1 — 0*30#), ( 2 ) 

provided that x, the ratio of thickness to diameter, is much less than unity. This 
expression implies that an increase in thickness of the disk leads to a decrease in 
torque on the disk for disks of ellipsoidal shape. Zernov ( 1908 ) has found experi¬ 
mentally that the torque on disks of short cylindrical form is given by 

L = L 0 (l + 2-78a; - 9-05a; 2 ), (3) 

where L 0 — §pa 3 F 2 sin 2 # and x is the ratio of thickness to diameter of the disk. 
The measurements were made on a small number of disks at a frequency of about 
90 cyc./sec. This expression indicates that an increase in torque accompanies an 
increase in thickness of the disk, provided that x is not very large and this is at 
variance with Konig’s theoretical result for ellipsoids. It is clear that the correction 
for finite thickness of the disk requires further investigation. 

( 6 ) Lack of infinite inertia of the disk , and 
(c) Diffraction of sou'nd by the disk 

A. B. Wood showed in 1935 that a Rayleigh disk immersed in an accelerating 
medium follows to some extent the motion of the medium. This motion of the disk 
leads to a reduction in relative velocity between the disk and the medium and 
therefore to a decreased torque on the disk. Wood has shown that because of move¬ 
ment of the disk, the torque on a light disk used under water may be reduced as much 
as 2500-fold from the Konig value. 

L. V. King ( 1935 ) has produced an excellent mathematical analysis of the be¬ 
haviour of the Rayleigh disk and has paid particular attention to the effects of finite 
inertia of the disk and of diffraction of the sound wave by the disk. For a disk of 
radius a placed at an antinode in a system of plane stationary waves, King finds 

r = t %[l + IW 2 CQS 2 g] 

°%+^ 0 [i-4W] } w 

provided that the disk has a diameter small compared with the wave-length of the 
sound. In this formula m x is the mass of the disk less the mass of the medium dis¬ 
placed by the disk, m 0 is the hydrodynamic mass of the disk (f pa 3 ), and k is 2n times 
the reciprocal of the wave-length. At low frequency, where ka is much less than unity, 
the correction factor reduces, for 6 = 45°, to nt 1 l(m 1 + m 0 ) 3 and this is in agreement 
with Wood’s expression for the effect and also with the results of his experiments in 
water. There can be little doubt that this correction should be applied to all experi¬ 
mental values of torque on the disk. 



299 


An investigation of the performance of the Rayleigh disk 

For frequencies at which, ka approaches unity the correction factor becomes, for 
6 = 45°, approximately (1 + %(ka ) 2 —This diffraction correction amounts in 
air to about 1 % of the torque for a disk of J in. diameter at 3800 cyc./sec., i.e. for 
ka = 0 * 22 . The correction is therefore important only at the higher end of the 
audio-frequency range. 

In most investigations, including our own, in which the Rayleigh disk is, used, 
care is taken to work only with disks of diameter small compared with the wave¬ 
length. The smallness of the correction would seem to justify, at least until further 
evidence is available, the application of King’s theoretical correction, since this 
rests on as firm a basis as does the general form of Konig’s expression for the primary 
torque. Barnes & West, studying the behaviour of Rayleigh disks at high frequency, 
find that discrepancies of more than & % do not appear between the actual torque 
and that given by a formula of the same type as Konig’s, provided that the ratio of 
disk diameter to wave-length is less than 0 - 2 . 

(d) Restriction upon the streaming near the disk , caused by the 
proximity of the walls of the enclosing tube 

Tomotika ( 1937 ) has recently calculated the effect of enclosing walls upon the 
torque in the case of the two-dimensional analogue of the Rayleigh disk and finds 
the torque to be increased in the factor 

(1+0*411 z 2 + 0-247* 4 + 0* 120z 6 ...), (5) 

where z = (width of disk)/(width of channel), and where 6 — 45°. His calculation 
assumes the medium to be incompressible and therefore ignores wave effects. His 
paper includes results of measurements on disks made by Sato in support of the 
calculations. The brevity of the account makes appraisement of the observations 
difficult. These experimental results are in qualitative agreement with the two- 
dimensional theory, but show a substantial dependence on frequency. 

Zernov ( loc . tit.) has found experimentally that in the presence of an enclosing 
tube the torque is increased in the ratio 

(l + l*2z 2 -5*8z 4 ):l, ( 6 ) 

where z = (diameter of disk)/(diameter of tube). However, only a small range of z 
was covered in the work, and the maximum value of the correction was 6*1 %, 
corresponding to z = 0 * 3 . Barnes & West ( 1927 ), working with disks of various sizes, 
have observed^no marked effect on the torque due to the presence of the walls. It 
is evident that further investigation of this correction is desirable. 

It is appropriate here to draw attention to a factor which has been reported to 
cause, on occasion, erratic behaviour of the disk (Barnes & West, 1927 ). When the 
frequency of the sound is close to a resonant frequency of flexural vibration of the 
disk, the vibrations of the disk modify the sound field and lead to anomalous 
behaviour. The anomalies are limited, however, to a small range of frequency close 
to the resonant frequency of the disk. 



300 


R. A. Scott 


In order to determine with more certainty the influence on the torque of the 
factors enumerated above, especially in the range of frequency 250-4000 eye./sec., 
measurements were made by the author to determine how the torque on the disk 
varies as changes are made in the dimensions and disposition of the disk and in the 
intensity and frequency of the sound. 


3. Apparatus * 

(a) Resonant tube and the Rayleigh dish 

Investigation at audio-frequencies of the torque on Rayleigh disks must be made 
in a draught-free space and in an acoustic field of known direction and of good 
uniformity. The most convenient way of providing these conditions is to immerse 
the disk in the sound field in a resonant tube, such as can be made by closing one 
end of a tube with a solid reflector and placing a source of sound at the other end. 
Rayleigh has shown that the wave front inside a long tube tends to become plane 
and normal to the axis provided that the ratio of the diameter of the tube to the 
wave-length is less than 0*59. Experimental explorations of the field in a closed-end 
resonant tube have been made by the author for several tubes with the aid of a 
microphone which receives the sound via a narrow probe-tube (0*159 cm. bore, 
0*079 cm. wall) and it has been found that fields which are to a close approximation 
plane-stationary can be obtained up to frequencies indicated by the Rayleigh 
criterion. Marked deviations from the plane-stationary field no not occur until 
roughly twice this frequency is reached, provided that the attenuation along the 
tube is not too great. 

Subject to the limitations discussed above, the acoustic field in a closed tube 
consists of plane, stationary waves in which velocity antinodes are spaced at odd 
multiples of JA from the reflector end of the tube. When the tube is in effect closed at 
both ends and in resonance the total length of the tube is some multiple of the half 
wave-length (apart from end corrections which are usually small). At all frequencies 
for which the tube is an odd number of half wave-lengths long, the reflector is at a 
pressure maximum and the midsection of the length of the tube at a velocity maxi¬ 
mum. In the region of the velocity maximum, the velocity is axial and substantially 
uniform. The midsection of the tube is therefore a suitable place at which to use a 
Rayleigh disk. The reflector at the end of the tube consists of the flat diaphragm of 
a microphone. Neglecting dissipation in the tube,* it can be shown that the r.m.s. 
pressure p at a node in the tube is related to the r.m.s. velocity V at an antinode by 
the expression __ y ^ 

where p is the density of the air and c is the velocity of sound. The e.m.f. from the 
microphone and the r.m.s. velocity at an antinode are therefore related in magnitude 
by a constant factor for sound of fixed frequency. Fields of definite acoustic velocity 

* This approximation involves an error which is less than 1 % of the pressure for any of 
the measurements described later in this paper. 



An investigation of the performance of the Rayleigh disk 301 

can be maintained in the tube in terms of the voltage produced at the microphone, 
and measurements can be made on a series of Rayleigh dis ks to deter min e the 
relations between the dimensions of a Rayleigh disk, the torque and the particle 
velocity. It should be observed that the construction of such microphones as are 
suitable for precise measurement is too complicated to allow of direct calculation 
of velocity at an antinode from the voltage at the microphone. Thus, for example, 
the resistance and the effective stiffness associated w T ith the diaphragm of a con¬ 
denser microphone are critically dependent on the motion of the air in the gap 
between the diaphragm and back plate, and this motion cannot be satisfactorily 
determined. 



The tube (figure 1) used in the greater part of the experimental work is 68*4 cm. 
long and of internal diameter 4-440 cm. The wall of the tube is of brass 0*318 cm. 
thick and the bore is smooth and closely circular in section. The source of sound 
consists of a moving-coil loudspeaker unit of the diaphragm type such as is used to 
drive large exponential horns. This type of driver was chosen because it is capable 
of transmitting considerable acoustic power to the tube without introducing much 
harmonic distortion. The unit connects by rubber tube with a hole 1*58 cm. diameter 
in the centre of a brass plate 0*635 cm. thick which closes the driving end of the tube. 
The condenser microphone at the other end is fitted with a diaphragm flush with the 
front of its casing except for a slightly sloping zone at the edge ofthe diaphragm. 
The microphone is otherwise of conventional type. Such microphones have been in 
use over a considerable number of years in this laboratory and have been found to 
possess very stable pressure sensitivities. The acoustic impedance ofthe diaphragm 
is high compared with the characteristic acoustic impedance of the air, and it can 
be shown that the reflexion at the microphone end of the tube is substantially 
complete. 



302 


R. A. Scott 


A range of Rayleigh disks was constructed to meet the requirements of the 
experimental work, and details of the disks are listed in table 1 . The disks were either 
ground or turned so as to have edges smooth and square with the faces. The Rayleigh 
disk in use was supported at the centre of the tube on a fused-silica suspension 36 cm. 
long and approximately 6/i diameter. The suspension was calibrated by means of 
circular copper inertia disks which were supported by the suspension and allowed 
to oscillate in a horizontal plane. The measurement of torsion constant as determined 
for a Suspension from two different inertia disks gave agreement to within about 
0*2 %. The suspension was mounted from a torsion head supported by a column 
attached to the tube and entered the tube through a hole 0 * 16 cm. square in the wall. 
The torque on the disk was measured in terms of the twist of the torsion head re¬ 
quired to return the disk to a standard position in the field. The direction of the disk 
relative to the tube was found from the deflexion of a beam of light reflected from the 
silvered or polished surface of the disk. The light entered the tube through a thin 
glass window, 0*95 cm. diameter and 0*015 cm. thick, which was recessed in the 
wall of the tube so “as to affect as little as possible the continuity of the inner wall 
and the area of cross-section of the tube. 


Table 1. Disks used m the experimental work 


Wood-King 

correction 

maximum to torque resonant frequency 





variation 


for lack 

kcyc./sec. 



mean 

’ in diam. 

thick¬ 

of inertia 





diam. 

from mean 

ness 

of disk 

2 nodal 

1 nodal 

disk 

material 

cm. 

cm. x 10~ 4 

cm. 

0/ 

/o 

diameters 

circle 

A 

glass 

0*6284 

15 

0*0162 

0*77 

21*6 

— 

A x 


0*6335 

18 

0*0162 

0*71 

21*2 

— 

B 


0*9942 

10 

0*0142 

1*41 

10*1 

15*9 

a 

)f 

1*0950 

15 

0*0183 

1*18 

8*0 

12*5 

D* 

99 

1*1582 

10 

0*0211 

1*13 

8*2 

12*9 

E 

99 

1*9143 

20 

0*0122 

3*22 

1*7 

2*7 

F 

99 

2*5959 

8 

0*0191 

2*71 

1*5 

2*3 

D a 

99 

1*1585 

10 

0*0135 

1*69 

5*2 

8*2 

d 3 


1*1580 

10 

0*0343 

0*77 

13*4 

21*0 


99 

1*1595 

30 

0*0737 

0*32 

28*4 

, — 


99 

1*1608 

20 

0*1219 

0*19 

47*2 

— 

d 4 

copper 

1*1618 

56 

0*0200 

0*34 

5*6 

8*8 

D, 

molybdenum 

1*1585 

5 

0*0064 

0*89 

2*5 

3*9 

d 8 


1*1570 

15 

0*0064 

0*89 

2-5 

3*9 


The electrical apparatus used in the experimental work needs little description 
here. The driving unit was "supplied with energy from a push-pull oscillator and 
amplifier capable of providing to the unit an output of about 5 W. The e.m.f. pro¬ 
duced at the microphone was amplified by a multi-stage amplifier and passed 
through a tuned-circuit analyser (vide Churcher & King 1934 ) to a rectifier moving- 
coil voltmeter .and the overall gain of the apparatus adjusted to give a convenient 





303 


An investigation of the performance of the Rayleigh disk 

deflexion of the meter. The input voltage necessary to produce an equal deflexion 
was then found from an accurately calibrated potentiometer supplied with voltage 
of the working frequency from the oscillator. 

Measurements of frequency were made in terms of a Sullivan valve-maintained 
tuning fork which had been standardized at the N.P.L. and gave a frequency of 
1000-0 cyc./sec. Comparisons of the working frequency and the standard frequency 
were made with a Cossor double-beam cathode-ray oscilloscope which was con¬ 
nected so that the two sets of plates for vertical deflexion were controlled from the 
separate supplies. Appropriate adjustment of the time-base makes it possible to 
obtain a pattern on the screen consisting of two sinusoidal traces moving slowly 
relatively to each other and containing less, and usually many less, than 30 cycles. 
Measurement of the slip-rate and of the number of traces in each pattern makes it 
easy to find the frequency of the oscillator to well within 0-1 % at all the frequencies 
used in the experimental work. 

The entire electrical apparatus is particularly stable, and it is possible to rely on 
the constancy of indication of the meter over a period of some minutes to as close 
an accuracy (about 0-2 %) as it is possible to read the meter. 

The bulk of the work on the first part of the investigation involves comparison 
of torque on different disks when placed at the centre of the resonant tube, and little 
further information is necessary about the precise nature of the sound field beyond 
the assurance that the disk is in a field of uniform axial velocity. However, the 
comparison of torque on disks in different tubes (see § 4 (b) to follow), depends not 
only on the uniformity of the field, but also upon the proper location of the antinodes 
in the tube. With the probe-microphone referred to in §3 (a), explorations of 
pressure were made along and across the tube and these confirmed that the field 
conditions within the tube were satisfactory. Further reference to these field 
measurements will be made only where necessary. 

4. Results 

In the experimental work described below, the disk was set so that its normal 
was at 45° with the axis of the tube, except in the case of the determination of 
variation of torque with angle. 

(a) Variation of torque with intensity of sound 

According to Konig the torque on a Rayleigh disk should vary as the square of 
the oscillatory velocity and therefore as the square of the voltage from the micro¬ 
phone. The relation, for disk A of table 1, between torque and voltage was found at 
frequencies of 250, 750, 1250, 4250, and 8250 cyc./sec. over the range of oscillatory 
velocity 0-13 to 25 cm./sec. (r.m.s.), i.e. intensity levels from 89 to 134 db. or a range 
of torque of about 37,000:1. The graphs in figure 2 show the ratio (torque)/(square 
of voltage from microphone) plotted against the oscillatory velocity for frequencies 
of 750 and 4250 cyc./sec.; the observations at the higher intensities (crosses) were 



304 


R. A. Scott 


made with a thicker suspension. The curve for 750 cyc./sec. shows that the above 
ratio is constant to within about 1 % in torque over the whole range of intensity. 
Similar results were obtained at all the frequencies used except at 4250 cyc./sec. 
In the latter case the graph shows slight curvature, and the departures from con¬ 
stancy in the ratio may be regarded as significant. This small discrepancy is, how¬ 
ever, only % in torque (i.e. a little over 1 % in velocity, if considered in terms of 
the acoustic field). The substantial constancy of the ratio torque/(voltage) 2 is 
powerful evidence of (a) proportionality between voltage from microphone and 
acoustic pressure at its diaphragm, and ( b ) proportionality between torque on the 
disk and the square of the acoustic velocity at the centre of the tube. 



Figure 2. Variation of torque with intensity of sound. 

O light suspension. x heavy suspension. 

In the remainder of the work the above relations are assumed and the square of 
the voltage from the microphone has been used to characterize the intensity of the 
acoustic field at the centre of the resonant tube. In the work described below, it is 
required to find the manner of variation of the torque with changes of other variables 
for a disk in a field of fixed intensity; for this purpose, measurements of torque were 
made for a series of intensities for each experimental condition and the mean of 
ratios of torque/(voltage) 2 is used for the comparisons. 

( b ) Effect on torque of proximity of wall of tube 

Comparisons have been made of torque/(voltage) 2 for various disks in each of 
two tubes. The first tube is that already described and has a diameter of 4*440 cm.; 
the second tube is also of brass and of similar construction and has a diameter of 
12*70 cm. and a length which can be varied up to 300 cm. The latter tube is closed 
at its ends by heavy pistons into whose faces the microphone and the end-piece 
from the smaller tube can be fitted. The tube has, a central suspension column, and 



305 


An investigation of the performance of the Rayleigh disk 

the windows and fittings of each of the tubes are arranged so that the driving unit, 
the microphone and the disk and suspension can be transferred easily from one tube 
to the other. The length of the larger tube was adjusted so that the tubes had a 
common resonant frequency. Observations were made with the aid of the probe 
microphone to confirm that the acoustic field was uniform and properly located with 
respect to the centres of the tubes. Measurements were then made of the torque 
on the disks A, E and F of table 1 in each of the two tubes for frequencies 740 and 
1250 cyc./sec. For each disk, the difference between the two values of torque/ 



Diameter of Tube 

Figure 3. Effect on torque of proximity of wall of tube. 

(а) Experimental curve x 740 cyc./sec.-f 1250 cyc./sec. 

(б) Tomotika’s two-dimensional analogue. 


(voltage) 2 provides a chord of the curve which relates torque/(torque in free space) 
to the ratio (diameter of disk)/(diameter of tube). From the chords the shape of the 
whole curve has been synthesized. The sets of data obtained are in reasonable agree¬ 
ment for the two frequencies and a single curve (a of figure 3) has been drawn to 
summarize the results. Attempts to determine the shape of the curve for a frequency 
of 2750 cyc./sec. were unsatisfactory, since measurements of pressure showed 
variations across the tube of rather more than 1 %, i.e. equivalent to more than 2 % 
in torque. Such uncertainty in the field is large enough to swamp the comparatively 
small effect of the wall of the tube on the torque. At frequencies higher than 
2750 cyc./sec. the field variations, and therefore the uncertainties, were larger. 
Curve b in figure 3 shows Tomotika’s theoretical results for the two-dimensional 
analogue of the Rayleigh disk. 



306 


R. A. Scott 


(c) Effect on torque of finite thickness of disk 

A series of disks (D^Dg in table 1), ranging in thickness from 0-0064 to 0-1219 cm., 
were made of closely similar diameter (1-159 cm.). Measurements were made of the 
ratio of the torque on each disk to the torque on the disk D, of thickness 0-0211 cm., 
for the same oscillatory, velocity and the results, suitably corrected with the aid of 
Konig’s formula for the small variations in diameter, are plotted against the ratio 
(thickness of disk)/(diameter of disk) in figure 4. The thinner disks were made of 
copper and molybdenum, since disks of glass would have been so light as to involve 



Figure 4. Effect on torque of finite thickness of disks. 

(a) Author’s experimental curve for cylindrical disks. 

(b) Zernov’s experimental curve for cylindrical disks 0749 eye./sec. 

(c ) Konig’s theoretical curve for ellipsoidal disks 4- 3272 cyc./sec. 

large corrections for lack of i nfini te inertia of the disk. Small corrections for lack of 
infinite inertia of disk were applied in accordance with Kang’s formula. Agreement 
between the two sets of results corresponding to 749 and 3272 cyc./sec. is sufficiently 
good to allow a single curve to be drawn through the points to represent the manner 
in which the torque on a Rayleigh disk depends upon its thickness. Curves are also 
included in figure 4 to show the shape of this relation as determined theoretically 
by Konig for ellipsoidal disks, and experimentally by Zernov for cylindrical disks. 

{d) Variation of torque with radius of disk 

The ratio between torque and radius was investigated for a number of glass disks 
suspended in the small tube. Comparisons were made in quick succession between 
torque/(voltage) 2 for disk A and for each of the disks B, C, D and E. The torque was 



307 


An investigation of the performance of the Rayleigh dish 

corrected by King’s formula for lack of infinite inertia of the disks and by means of 
figures 3 and 4 of this paper, for proximity of the wall of the tube and for finite 
thickness of the disk. In figure 5 are plotted the logarithms of the ratios of the torques 
on the comparison disk and the standard disk against the logarithms of the ratios 
of the radii of the two disks. The results are shown for the two frequencies 750 and 
3270 eye./sec., and the solid line which is drawn through the set of points has a slope 
of 3*00. The measurements show that the torque on the disk is closely proportional 
to the cube of the radius of the disk. * 



Figube 5. Variation of torque with radius^of disk. 

The solid line has a slope of 3*00. O 750 cyc./see. x 3270 cyc./sec. 

(e) Variation of torque with angle between dish and stream 

Variations of torque with orientation of the disk relative to the direction of the 
sound wave were made in the larger tube, since this was fitted with a window sub¬ 
tending a large angle (75°) at the centre of the tube. The direction which the particle 
velocity in the central section of the tube would have in the absence of the disk was. 
found from the setting of the torsion head for which the disk remained undeflectedl 
in either direction, after the sound was turned on in the tube. This direction was 
found to coincide with that of the geometrical axis of the tube to well within 1°. 
Measurements of torque/(voltage) 2 were made for various deflected positions of the 
disk A for frequencies of 740, 1290 and 2900 cyc./sec., and this ratio is compared 
in table 2 with the sine of twice the angle 8 between the normal to the disk and the 
direction of the stream. 


Vol. 183 . A. 


2X 




308 


R. A. Scott 


Table 2. Variation of torque with inclination of disk to stream 


Frequencies: 740, 1290 and 2900 cyc./sec. 

740 cyc./sec. 1290 cyc./sec. 2900 eye./sec. 




L 

L 

L 

L 

L 

L 

0 

* sin 20 

E 2 

E 1 sin 20 


E 2 sin 20 

E 2 

E 2 sin 20 

10 

0*3420 

3312 

9684 

5945 

1738 

1162 

3397 

15 

0*5000 

4884 

9768 

8685 

1737 

1695 

3390 

20 

0*6428 

6260 

9740 

1123 

1747 

2190 

3407 

% 25 

0*7660 

7486 

9774 

1343 

1754 

2614 

3413 

80 

0*8660 

8364 

9659 

1507 

1740 

2916 

3367 

35 

0*9397 

9043 

9624 

1622 

1726 

3175 

3378 

40 

0*9848 

9437 

9583 

1727 

1754 

3337 

3389 

45 

1*0000 

9517 

9517 

1756 

1727 

3384 

3384" 

48-16 

0*9939 

— 

— 

1724 

1734 • 

— 

— 

49*24 

0*9891 

9297 

9398 

— 

— 

— 

— 


0 is angle between disk and wavefront; L is torque on disk and E is voltage from microphone 
corresponding to sound giving torque L. 


The foregoing series of measurements of torque L at an antinode in a plane¬ 
stationary sound field gn Rayleigh disks, of widely differing dimensions, has shown 
that the torque varies closely as the square of the r.m.s. velocity V and as the sine 
of twice the angle 6. The measurements show that for a very thin disk supported 
rigidly in free space, the torque would be proportional to the cube of the radius a. 
Thus the torque which would be exerted on,an infinitely thin, immovable Rayleigh 
disk is given by the expression 

L^L^V* sin 20. ’ (8) 

L x is here the counterpart of the factor §p and Konig’s expression (1) and must have 
Ihe dimensions of a density. L x is constant* for air of fixed density. The relation (8) 
maintains over a wide range of frequency (250-5000 cyc./sec.) and such small dis¬ 
crepancies as may accompany its application lead to errors which, judged by the 
consistency of the experimental data, are unlikely to be much more than 1 % of 
the torque. 

In the remainder of the investigation the relation between L x and Konig’s factor 
of f p has been found. For this purpose the determination of L, a and 6 present little 
difficulty and evaluation of the ratio reduces largely to an independent measurement 
of oscillatory velocity. 


Evaluation of the numerical constant involved in the expression fop torque 

Determinations of sound particle Velocity can be carried out without difficulty 
by a number of methods for frequencies of 50 cyc./sec., and below (vide the work of 
Zernov and others referred to earlier). At higher frequencies, and especially above 

* Records were kept of the pressure, temperature and humidity of the air, and suitable 
small corrections were applied to the torque in proportion to the change in density of the air. 



309 


An investigation of the performance of the Rayleigh dish 

1000 cyc./sec., few methods of measurement exist, and such as have been used are 
indirect and for the most part difficult to analyse with precision. The method adopted 
by the present author depends on the determination of particle amplitude from the 
length of the traces of smoke particles entrained in the sound field. This method has 
been used by Carriere ( 1929 ) and by Andrade ( 1931 ), and by Andrade & Parker 
( 3 : 937 ), and approximates closely to direct measurement of one of the fundamental 
acoustic quantities, viz. oscillatory amplitude. The validity of the method depends 
on the extent to which the smoke particles take up the full motion of the medium 
and this is susceptible to approximate calculation and, to some extent, to experi¬ 
mental demonstration. 

The theory of motion of a spherical particle immersed in an oscillating fluid has 
been worked out by Stokes and others (see Andrade 1931 ). If w 0 and v 0 are the 
amplitudes of motion of the sphere and of the surrounding medium (air), if R is the 
radius of the sphere and p 0 is its density, if v is the kinematic viscosity and p the 
density of the medium and if n is the frequency of the sound: ~ 


where a = - + % —, and 6 = 4 /—* The smoke particles were obtained by igniting 
3 3 p jtt /y 7 T 71 

20 mg. of magnesium ribbon in a 2*51. flask. Andrade & Parker have estimated the 
size of MgO particles produced in this general manner and find them to have an 
effective radius of 0 * 33 / 6 . (Observations on smoke from our own apparatus have 
kindly been made by the staff of the British Cotton Industry Research Association 
on an R.C. A. electron microscope. The large bulk of the particles which are deposited 
on the floor of a roughly cubical vessel containing about 20 c.c. of smoke during a 
short period, about 20 min. after the smoke has been formed, are cubes with edges 
between 0-1 and long.) The ratio w 0 /v 0 is calculated according to the forinula (9) 
above for spherical particles of radius 0*3/^ and the values given in table 3. 

Table 3. The smoke paeticle as an indicator of air particle amplitude 
Calculation on the assumption of a MgO particle of radius 0*3/t 


frequency 

amplitude of smoke particle 
ratio — _ * 

cyc./sec. 

amplitude of air particle 

250 

0*99996 

1250 

0*99916 

2250 

0*9975 

3250 

0*9947 

4250 

0*9913 

5250 

0*9868 


It seems reasonable to suppose that this calculation gives at least the right order 
of magnitude for the ratio w 0 /v 0 , and that the particles follow the motion of the air 
to within about 1 % for all frequencies below 4000 cyc./sec. Evidence of the smallness 

2X-2 


Wo r l + 36 + |6 2 + t6 3 + |6 4 
v 0 ~ La 2 + 3a6 + § 6 2 + § 6 3 + f 6 4 _ 



310 


R. A. Scott 


of the discrepancy between the amplitudes of the smoke and the air particles was 
provided by observing the amplitudes of oscillating MgO smoke particles which 
had been standing in a still atmosphere for about 20 min. The particles, some of 
which had become enlarged by coagulation, showed colours which ranged from blue 
to red and must therefore have varied considerably in size. Nevertheless, no dis¬ 
crepancy between the lengths of the traces of different particles was measurable, 
even at frequencies of 5000 eye./sec. 

The smoke particle method appears to provide a standard of sound amplitude 
which is adequately accurate for measurements up to 2000 or 3000 eye./sec. and is 
probably not in error by more than 1 or 2 % at frequencies of 5000 cyc./sec. 

Apparatus 

The resonant tube of 4-440 cm. diameter, described (cf. Andrade & Parker) 
earlier in the paper, was adapted by the provision of suitable windows and a micro¬ 
scope tube, so that measurements could be made of the amplitude of smoke particle 



Figure 6 . Resonant tube fitted for smoke particle amplitude m easurement. 

L, arc; W, water bath; P, condenser lens; F, felt lagging; 

O l9 0 2 , microscope objectives; G , eye piece graticule. 


traces at the section of the tube at which the Rayleigh disk was otherwise situated. 
A cross-section of the tube is shown in figure 6. The particles were illuminated from 
an arc-lamp (L) and a large-aperture projection lens (P). As a protection against 
convection currents in the air in the tube, a water-bath (W ) was interposed between 


An investigation of the performance of the Rayleigh dish 311 

the arc and the tube, and the tube was covered with a thick layer of felt ( F ). A com¬ 
posite microscope objective was constructed from two pieces: and 1 in. Davon 
{Of) of n.a. 0*26 and a 16 mm. Beck (Of) of n.a. 0*28. The composite objective has a 
focal length of about 2*5 cm., an n.a. of 0* 11 and a sufficiently large working distance 
(about 2*5 cm.) to allow measurements to be made of the particle amplitude at 
various distances from the axis of the tube. Measurements of the length of traces 
was made by fitting to the eyepiece a glass graticule consisting of two parallel lines. 
The intensity of the sound in the tube was adjusted so that the ends of the particle 
as seen through the microscope appeared just to touch the lines. The effective 
separation of the lines in the object-space was determined by calibration against an 
engraved scale which itself had been .checked by the N.P.L. A series of graticules 
was used with spacings which range, when referred to the object-space, from 
15*3 to 251*9 p. 

Results 

At each of the working frequencies, corresponding measurements of length of 
ibrace and output voltage from" the microphone were obtained and plotted for a 



(a) 749 cyc./sec. (6) 3275 cyc./sec. 

series of about six graticules of different spacing. In each case the traces of about 
ten separate particles were measured. Figure 7 shows typical curves of apparent 
length of particle trace against voltage from the microphone for the frequencies 



312 


K. A. Scott 


749 and 3275 cyc./see. In each case the curves are straight lines which cuffche length 
axis at a distance from the origin of between 3 and 4/t. The intercept represents the 
effective diameter of the diffraction disk produced by the microscope and is in close 
agreement with the theoretical value (3 ji) of the diameter given by d = k . l-22A/n.a., 
where A is^the mean value of the wave-length used (0-55/t), and n.a. is the numerical 
aperture of the objective system (0-11). The factor k. which depends on the visual 
acuity and has a value of about 0*5, is the ratio of the apparent diameter of the disk 
to the eye, to the diameter of the first minimum of the Airy diffraction pattern. 
From the slope of the graph, the value of the ratio (particle amplitude)/(voltage) 
was obtained. Using this ratio, the velocity at the centre of the tube for any intensity 
of sound of this frequency could be found in terms of the voltage from the micro¬ 
phone. The tube and associated apparatus were then rearranged and the ratio of 
torque/(voltage) 2 determined for the Rayleigh disk A. The torque was suitably 
corrected for the thickness and lack of infinite inertia of the disk, for diffraction and 
for proximity of the disk to the wall of the tube. The value of L x in the formula 

L = A 1 a 3 « 2 sin26 l 

was then calculated by combining these data with the measured values of the radius 
of the disk and of the angle between the disk and the stream. The values of L x were 
found at each of the frequencies: 250, 750, 1250,1750, 2250, 3250 and 4250 cyc./sec. 
According to Konig’s formula, the value of ’L x should be (f p). The ratio of L x as 
determined by experiment, to (f p) is plotted against frequency in figure 8. In the 



Frequency (c/s) 

Figtjbe 8. Relation between torque calculated from Konig’s formula 
and torque determined by measurement. 

O Merrington and Oatley. + Present author. 

Dotted line gives axis of reference %)n assumption* that particle is a sphere of radius 
0*3/4 and that formula 9 describes the departure of the smoke particle amplitude from the 
air particle amplitude. 



An investigation of the performance of the Rayleigh dish 313 

experimental determination of I* the smoke particles have been regarded as fol¬ 
lowing exactly the motion of the air. If we assume that the formula (9) gives the 
relation between the lengths of the traces of the smoke and the air particles and that 
the particles have an effective radius of 03/t, the values of i x /|p in figure 8 must be 
reduced in accordance with the results of table 3. The amount of the correction to 
the ratio is shown by the dotted curve in figure 8 and with the above assumptions 
this dotted curye should be regarded as the axis to which the ratio L-Jfp should be 
referred. For purposes of comparison, the values obtained for the ratio of L x to (f p) 
at a much lower frequency (9-22 cyc./sec.) by Merrington & Oatley, have been 
corrected for the effect of finite thickness of the disk by use of figure 4 of this paper 
and the mean value is plotted as a circled point in figure 8. Application of the 
correction curve figure 4 improves on the whole the consistency between the 
individual results of the above authors. 

In the region of 750 cyc./sec. a second comparison was made between L x and (§ p) 
using an electromagnetically driven diaphragm as a driving unit for the tube. The 
ratio agreed with that previously obtained to within 1 %. 

5. General discussion 

In the paper the form of the expression for the torque on a Rayleigh disk of 
cylindrical shape in a sound field has been investigated over a long range of intensity 
and frequency. During the investigation, the dependence of the torque on the, 
thickness of the disk and the modification to the torque which results from enclosing 
the disk in a smooth circular tube have been found. The experiments show that as 
the disks become very thin and substantially immobile, the expression for the torque 
tends to a form similar to that put forward on theoretical grounds by Konig. As 
the disk becomes thicker the torque tends to increase: for a cylindrical disk of thick¬ 
ness 0*02 of its diameter the torque is increased by 3 %. The thickness therefore has 
a marked effect on the torque, of a sign which is different from that deduced by 
Konig for disks of ellipsoidal form. This conclusion is in agreement with the results 
of Zernov, who worked with sound of much lower frequency. The effect of the wall 
of a cylindrical enclosing tube on the torque is shown to be small provided that 
the diameter of the tube is more than twice that of the disk. Tomotika has obtained 
theoretically a similar result for the two-dimensional analogue of the Rayleigh 
disk, using assumptions similar to those of Konig. The correction to the torque for 
the tendency of the disk to follow the oscillatory motion of the sound, has been 
determined independently both by Wood and by King and has been applied to the 
results of the present work. King’s correction—a diffraction correction—for the 
effect on the torque of the compressibility of the medium has been applied to 
the results, but it is likely that the correction is small since it was possible to use 
disks of diameter small compared with the wave-length. Experimental confirmation 
of the size of this correction is difficult to obtain. Large, and therefore easily 
measurable, effects are obtained only when the diameter of the disk is of the same 



314 


R. A. Scott 


order as the wave-length of the sound. The diameter of the enclosing tube must 
therefore be of the same order as the wave-length and in this case it is difficult to 
produce longitudinal waves in the air column without introducing transverse waves 
which distort the field and provide a spurious torque.* 

The assumptions of streamline flow, and the absence of discontinuous flow, are 
made in Konig’s theory of the Rayleigh disk, and the failure of these assumptions 
would invalidate Konig’s expression for the torque. It must be borne in mind that 
the agreement in general form between the experimental results and those predicted 
by Konig’s expression is not in itself evidence of the applicability of Konig’s theory. 
If we commence with the premise that the torque must depend upon a simple product 
of some power each of the density of the air, the radius of the disk and the velocity 
of the air, it can be shown by the method of dimensions that the' torque varies 
directly as the density, directly as the cube of the radius and directly as the square 
of the particle velocity. It is therefore probable that even if turbulence were present 
in the sound field near the disk, an expression of the form 

L = L x a s V z sin 26 

I 

would continue to describe tbe torque on the disk. Justification of the validity of 
Konig’s analysis is best afforded by comparison of the experimental value of L x 
with the Konig value (f p) of this factor. The comparison has been made and is 
described in the second part of this paper. 

The comparison of JU X with § p requires the independent determination of particle 
velocity, and this has been made by means of measurement of amplitude of smoke 
particles in the sound field. The comparisons show that the torque exerted oh the 
disk is larger than is givqn by Konig’s formula by an amount between 0 and 7 %. 
The magnitude of this discrepancy varies with frequency and is least in the neigh¬ 
bourhood of 1200 cyc./sec. At 250 cyc./sec. the discrepancy is about 7 % of the torque, 
and this agrees within 2 % with the observations of Herrington & Oatley when 
working under very different experimental conditions at frequencies between 9 and 
22 cyc./sec. With regard to the cause of this lack of agreement between the experi¬ 
mental and the Konig values for the torque, there is little which can be added save 
in confirmation of the observations of Herrington & Oatley. Investigation of the 
motion of smoke-laden air in the sound field in the neighbourhood of the Rayleigh 
disk confirms the existence, over the whole range of frequency and intensity covered 
by the present work, of Andrade vortices. There is strong evidence of the continued 
existence of these circulations for the lowest velocities at which it is possible to 

* The corrections which have been discussed above can in practice be made sufficiently 
small (say, less than 2 % of the torque) not to introduce serious errors in the application of 
Konig’s expression. Thus for a glass disk 0*63 cm. diameter and 0*007 cm. thick, the correction 
for diffraction (in air) is 1 % of the torque for 3800 cyc./sec. and the correction due to proxi¬ 
mity of the wall of an enclosing tube 5 cm. diameter is — 0*6 %. The correction for the thick¬ 
ness of the disk is —1*8 % and that for the mobility of the disk is +1*8 % of the torque. 
The last two of these corrections can each be reduced by using a slightly thinner disk made 
from a dense material such for example as polished stainless steel. 



315 


An investigation of the performance of the Rayleigh dish 

* 

estimate movement of the particles. No abrupt transition to vortex-free con¬ 
ditions could be observed. The apparent absence of transition is supported by the 
linearity of the relation between torque on the disk and the square of voltage from 
the microphone. The slight departure from linearity which is shown in figure 2 for 
4250 cyc./sec. does not suggest that the torque on the disk, even at this frequency, 
tends to be consistently lower for very small velocities. 

At 750 cyc./sec. an attempt was made to estimate the magnitude of the torque on 
the disk due to the steady circulations. The circulations were made visible by blowing 
magnesium oxide smoke into the air near the disk, and the shapes of the principal 
sections of the pattern were photographed using flat-beam illumination. Measure¬ 
ments were made of the velocity, for a given intensity level of the sound, at various 
points in the field and the magnitude of the streaming velocity close to the disk was 
estimated from the data. From the velocity an assessment was made in the following 
manner of the torque which should accompany the circulations. The maximum 
value of the defect of pressure near the surface of the disk was calculated by Ber¬ 
noulli’s law from the maximum velocity near the disk, and this defect of pressure 
was multiplied by one-third of the area of the disk to set an upper limit to the force 
on each face of the disk. The force was then multiplied by the diameter of the disk. 
These operations might be expected to give an overestimate of the magnitude of 
the torque due to the vortices. Investigation showed that this torque, although of 
correct sign, appears to be rather less than 1 % of the total torque and therefore 
insufficient to account for the departure from Konig's formula. 

The discrepancies of figure 8 are too large to be accountable in terms of experi¬ 
mental uncertainty. Such residual errors as may be associated with the uncertainty 
in the experimental measurements described in this paper are unlikely to be much 
larger than 1 % in particle velocity, i.e. 2 % in torque. The measurements of torque, 
radius of disk, density, frequency and voltage were made with a precision sufficient 
to determine L x to within 0*2 %. The largest uncertainties are due to three possible 
causes. Harmonic distortion of the acoustical waveform might have introduced 
errors in L x of up to 0-5 % during measurements of torque or of particle amplitude. 
In the higher frequency region, i.e. above 3000 cyc./sec., the observed non-uniformity 
of the sound field might have introduced errors of 1 % in the determination of L v 
Measurements of smoke particle amplitude could be made to about 1 %, that is, 
corresponding to an uncertainty of about 2 % in L v 

The investigation confirms the stability of performance of the Rayleigh disk and 
justifies its continued use as a standard of reference of acoustical intensity. Cal¬ 
culations of the particle velocity of the medium in terms of the torque on the disk 
must take account of the failure of the numerical factor f in Konig’s formula and 
suitable corrections amounting from 0 to 3*5 % of the particle velocity must be 
applied according to the results of figure 8. For the large bulk of measurements 
made in acoustics with microphones, an accuracy of 1 db. in intensity is adequate; 
in such work the correcticfris enumerated above, which amount to no more than 
0-3 db., are barely significant. , * 



316 


R. A. Scott 


The author is indebted to Mr B. G. Churcher and to Mr A. J. King for authoriza¬ 
tion of the work in this paper and for their continued interest and encouragement 
during its progress. The author wishes to thank the staff of the British Cotton In¬ 
dustry Research Association for their aid in determining with their electron 
microscope the size of the smoke particles. The author expresses, his thanks to 
Dr A. P. M. Fleming, C.B.E., Director of Metropolitan-Vickers Electrical Co. Ltd., 
for permission to publish this paper. 

References 

Andrade 1931 Proc. Boy, Soc. A, 134, 445. 

Andrade & Parker 1937 Proc . Boy. Soc. A, 159, 507. 

Barnes & West 1927 J. Instn Elect. Engrs, 65, 871. 

Carriere 1929 J . Phys. Badium, 10, 198. 

Churcher & King 1934 J. Instn Elect. Engrs , 75, 411. 

King 1935 Proc. Boy. Soc. A, 153, 17. 

Konig 1891 Ann. Phys., Lpz. 43, 43. 

Merrington & Oatley 1939 Proc. Boy. Soc. A, 171, 505. 

Rayleigh 1880 Proc. Gamb. Phil. Soc. 4, 18. 

Rayleigh 1882 Phil. Mag . 14, 186. 

Tomotika 1937 Proc. Phys. Math. Soc. Japan , 19, 328. 

Wood 1935 Proc. Phys. Soc. 47, 779. 

Zernov 1908 Ann. Phys., Lpz. 26, 79. 


A note on the Hamiltonian theory of quantization 
By T. S. Chang, Ph.D., Central University , Chungking 
(Communicated by P. A. M. Dirac, F.R.S.—Received 11 July 1944) 


In this note a study is made of the field equations which are obtained from varying a Lagran- 
gian subject to auxiliary conditions. It is shown that with proper choice of conjugate vari¬ 
ables and the Hamiltonian, they can be brought to, canonical forms and thus permit quan¬ 
tization in the usual way. It is then pointed out that for fields with some conjugate Variables 
missing, it is sometimes possible to introduce such auxiliary conditions without affecting the 
field equations but with the result that all the new canonical variables are present, thus 
allowing the application of the standard methods of quantization. To illustrate this, the 
Maxwell field is quantized subject to V. A + c -1 d(p/dt =0. The usual supplementary con¬ 
dition on (V. AL+c- 1 d<j>jdt) \jr~0, is found to be shifted to other conditions on ty. Though 
in the beginning there are apparently some complications, the final result is simple. 

The Hamiltonian theory of a continuum and its quantization was first given by 
Heisenberg & Pauli ( 1929 ) and was later developed in another form by Weiss ( 1936 , 
1938 ( 3 ,, 19386 ). The latter form has the advantage that it is throughout relativistic- 
ally invariant, and thus no explicit proof of the relativistic invariance of the equa¬ 
tions obtained at various stages is necessary. 

According to these authors, the field equations are the Euler equations obtained 
by taking the minimum of the integral of a certain invariant function L (known as 



317 


A note on the Hamiltonian theory of quantization 


Lagrangian) over the space formed by the independent variables. In this note we shall 
take the minimum after imposing auxiliary conditions on the dependent variables 
and see the alterations which must be made on the canonical equations and the 
subsequent quantization. In view of the possibility that a general field theory may 
contain such auxiliary conditions, such a study is perhaps not entirely meaningless. 

Sections 1, 2 and 3 give the general theory of quantization for fields with such 
auxiliary conditions. The method and the notations employed by Weiss will be 
followed. It will be seen that the modifications due to the presence of auxiliary 
conditions are few. To apply the theory, we shall quantize the Maxwell field after 
imposing on it the condition 


c at 



\ 


Though in the beginning there are more complications than in the usual theories, 
the final result is simple. 


1. Euler’s equations and the complete variation 

Let z“ be the dependent variables (a = 1, 2, v), x i be the independent variables 
(i = 1,2,...,n), and z“ be the derivatives bz a jbx i . j - Take the minimum of 




( 1 ) 


subject to the / conditions 

z“, zf) = 0 (£= 1,2,...,/). 

Using Lagrange’s method of unknown multipliers, it is found that 


&S? dfZ b (d& df\ 
8 Z « + ^ 0z“ bx i [ 02 ? + H 02 ?/ 


= 0 , 


( 2 ) 

(3) 


which together with (2) gives z“ as functions of x { corresponding to a minimum of I 
when the values of x and z on the boundary 8 of the region G are kept fiKed. (2) and 
(3) are the field equations. 

Now find the complete variation of I. Using Lagrange’s method of unknown 
multipliers, then 





( 4 ) 


f Differentiation with respect to one of the n variables x i is denoted by b/brc*, and differentia¬ 
tion with respect to one of the n + v+nv variables x\ z* by d/dz a , d/dx\ d/dz 

% The two Sz a in ( 4 ) have different meanings. No attempt will be made to explain them here* 
the reader being referred to Weiss’s paper ( 1936 ). 



318 T. S. Chang 

where u r (r = 1, 2, 1) are the parameters for the surface 8 and \ are the ' 

multipliers which may yet be arbitrary functions of x i , 

< 6 > 


x t = mm-z\ 


N t being defined by 


bjP 

a!N, = 

bx 1 
bti n ~ 1 


bx n 

bu n ~ l 


Taking SI as due to variations of z a from a solution of the field equations (2) and (3), 
we may let the r/’A be the /Is in (3), and then 




h a + Xt8x i )du, 


where P* and X* are given by the right-hand sides of (5) and (6) with replaced 
^7 Pi- 

The conservation equations are modified. Letting 




so that X't - N, UL then 


bUl 3 L 


= B+Wih- 


dx* 


2. Hamjxtonian formulation 

Introduce a co-ordinate u° 7 so that 

N^jbu*) - 1 . ( 11 ) 

This co-ordinate is denoted by w in Weiss’s papers, and is, for symmetry in notation, 
changed to u° in the present note. It is seen at once that 

N^dL/dzf) = (12) 

and XtSx 1 = (L-z§P a )Su*-~z?PJu r , f (13) 

where zf denotes bz a /bu r and z% denotes bz^/bu 0 . For shortness, the function 
L—z%P a is denoted by H'. 

It is necessary to note that P a is defined by (5) with quantities ^ whose nature 
has'not been specified. Thus H r is a function of x i (or u r , u°), z*, z% and ijg. With the 


*t t runs always from 1 to n — 1. 



A note on the Hamiltonian theory of quantization 319 


help of (2) and (5) it may be transformed into a function of x i , z“, z“ and P a , say H. 
Evidently dH must be dH' plus a linear combination of dp and t 



the linear combination being so chosen that its sum with dH' does not contain 
dzg and dm. Noting that 


can be reduced down to 




and thus does not contain dz% and drj^ it may be concluded that it is dH. Needless 
. to say, in (14) are functions of x i , z a , z“ and P a determined by (2) and (5). 

dH 

Prom (14), it follows that 0P + b^ * (15) 

It is easy to see now that the equation 


dH 

dz« 


b_dH 
bu r dz« 



(16) 


is precisely (3) with 7jg in place of Hence the set of equations (2), (5), (16) is 
completely equivalent to the set of equations (2), (3). Equation (15) follows from 
(2) and (5) and the definition of the function H . Hence we have the set of equations 
(2), (5), (15) and (16) to solve for z*. But if (15) and (16) are independent, which 
they must be in a general case (since there is no reason to the contrary), they are 
already sufficient. Hence (2) and (5) must be redundant equations, i.e. they are 
deducible from (15) and (16). A rigorous proof of this statement will not be attempted 
here. 

The foregoing says that the set of equations (15), (16) is equivalent to the set 
(2), (3). It says also that if P a and z a satisfy (15), (16), i}g are simply the [ig s in (3), 
and no distinction between the present P a and the P* in §1 is necessary. Prom 
the appearance of the equations (15), (16), H may be called the Hamiltonian, (15), 
(16) the canonical equations, and P a , z* the conjugate variables. 

Conversely, given a H and the equations (15), (16), of which the solution is 
known to satisfy (2), it is possible to find a Lagrangian whose minimum subject to 
(2) gives the given field equations. In fact, a possible Lagrangian is obtained from 
H+z%P a by eliminating P a with the help of (15), 

The conservation equation is, from (14), (15), (16), 



™- d Jt +E7 , d JL 

3 u? du* u 0 ’ 


( 17 ) 


rj s being functions of x i , z a , z“, P a obtained by solving for them from (2) and (5). 



320 


T. S. Chang 


3. Passage to quantum theory 


Let us see if the theory of relative integral invariants as given by Weiss (1938a) 
can he taken over here. According to him, the space of the independent variables in 
consideration should be of a cylindrical type, by which is meant that if the inde¬ 
pendent variables are x, y, z and the time t, the space is bounded by two space-like 
surfaces and S. 2 and a time-like surface.! Let us consider the proper specification 
of data on 8 X which are here called the initial data and those on the time-like surface 
which are called the boundary data. From the form of equations (15), (16), it is at 
once seen that values of z“ and P x on S 1 form a complete set of initial data and those 
of z“ on the time-like surface a complete set of boundary data. This may be explained 
by the fact that though on S 1 values may be assigned to only 2v — f of the z“ and z§’s, 
since there are/relations between them, yet the /i’s in (3) are arbitrary, and strictly 
speaking their values at S t form a part of the initial data. This allows arbitrary values 
to be given to P% and z“ on 8 V or P a and z“ on S x (since P* and P a are identical so long 
as P a) z“ satisfy (15), (16)), and thus to treat P a , z“ as a complete set of initial data. 

With the initial data P*, z“, the boundary data z“ and the formula ( 8 ), it is not 
difficult to see that all brackets \P%{u), P$(u')], [P*(u),z^(u')], \z a (u),zP(v,')\,% as 


defined by Weiss ( 1938 a), are integral invariants. Thus 

[P a ,P fi ] = [z«, zfi] = 0 , [P a (u),zl>(u')] = 8£$(u-u'), (18) 

and thus ’ ^{u),^-P a {u')z*{u')du'~\^ = (19) 

As usual, the canonical equations give 

< 20 > 

In quantum mechanics we define 

* [a,b]_ = ab—ba, [a,b] + = ab + ba. (21) 

Assuming the law ejy(M),Jtf(w')du'J (22) 


where e is a constant depending on the choice of u°, [z, P]_ and [z, P] + , etc., do not 
change with u° if they are initially c-numbers. (i) For the case [z, z]_, [z, P]_, etc., 
are c-nnmbers, the particular choice 


[P a ,P^]- = = o, c[P»,^K)]_ = *£9(u-u') 9 (18-1) 

etc., gives field equations identical with the classical ones. Equation (19) is re¬ 


placed by 


e /(“)> J- P a (u') z?(u') du’ 


+ 


df(u) _ bf(u) 
8u r bu T ’ 


(19-1) 


t A space-like surface (or a time-like surface) is one in which the vector joining a point in the 
surface to another point in its immediate neighbourhood also lying in the surface is necessarily 
a space-like (or may be a time-like) vector. 

X v> 9 u f stand for u x 9 u n ~ x and u' x , u' 2 > u'*" 1 . 



A note on the Hamiltonian theory of quantization 321 

which completes with ( 22 ) to form a relativistically invariant equation, (ii) For the 
case [z } z] + , [z, P] + , etc., are c-numbers, the particular choice 

[*«,*>]+ = [z*,z% = 0, e[P a (u),zfi(u')] + = . (18-2), 

etc., with l as an arbitrary real or complex number gives also classical field equations, 
provided that H = Z... z 'P , and on passing to quantum mechanics, we let H be 

tf = (2J)” 1 i:...[(Z+l)<;P. + (Z--l)P.O- (23) 

So far, it is difficult to find other simple forms of H , so that (18-2) and (22) give the 
classical field equations. The equation (19) is now changed to 

’ (I) [/(«)>- + + = (19-2) 

Thus if we have classically E... z'P , it may seem that there are many Fermi-Dirac 
schemes of quantization, corresponding to different values of l. But they are actually 
equivalent, for through a proper transformation of the z and P, one scheme may 
change into another. 

The question which arises if one requires z, P to be permanently real (or imaginary) 

can be almost always solved if L is chosen properly. For (i) let us restrict ourselves for 

definiteness to the case in which there are no auxiliary conditions, the Lagrangian L 

is symbolically quadratic in z and none of the P’s are missing. Let w°, u r be all real 

and let us assume e to be purely imaginary. Now suppose we choose L so that if z“, z£ 

are real c-numbers, so is L. On letting z and P be all real dynamical variables (or 

all imaginary) at one u°, which is consistent with (18-1), it is found that H at this 

u° is real, and thus z and P remain real (or imaginary). Case (ii) with its special 

form of the Hamiltonian can be realized only in cases in which L is symbolically 

linear in the dynamical variables z“ and also in the variables yP (a,/? = 1 , 2 , ..., v) 

and the conjugate variables of y do not exist, or in equivalent cases. Assuming that 

all the conjugate variables of z“, say P(z a ), exist and are independent, H(z a , z“, P(z“)) 

may be introduced as _ 

X-zgP(z“), 


whereupon the Euler equations for z“ and ?y /? become precisely the canonical equations 
with the Hamiltonian H and the conjugate variables z a , P(z“). With real u°, u r 
and imaginary e as before and with l chosen to be 1 for definiteness, we choose L 
so that if z“ and P(z“) are real c-numbers, so is H. Then on letting z be real and P 
be imaginary (or vice versa) at one w°, which is consistent with (18-2), we find that 
H is real at u° and thus the reality of z, etc., remains. 

This section will be concluded with a word on the passage from operator equations 
of the type 

±f(z, P ,u°,u r ) = M- 0 +e\f, ]#(«')<*«']_ (24) 


to a Schrodinger equation, concerning which no clarification, though mostly a 
repetition of well-known facts, can be considered as superfluous. For simplicity, 



322 


T. S. Chang 

assume n = 1 , u° = t, v = 1 , i.e. there is only one 2 and one P, both being functions 
of t only. It is necessary to note that H in (24) is a dynamical variable at time t. 
Hence in order *to get (24), it is necessary to assume for the wave function 'ijr at time 
t 9 say ijr h the equation 

( 25 ) 

where, in order not to forget that z, P are functions of t, a suffix t is attached to them. 
Let ns choose the fixed i/r in the Heisenberg picture to coincide with the moving x[r in 
the Schrodinger picture at t = 0 . Thus the fixed z, P in the Schrodinger picture, say 
z s , P s , coincide with the moving z, P in the Heisenberg picture at t = 0, say, z 0 , P 0 . 
Thus the correct procedure to get a Schrodinger equation is to solve z ( , P t in terms of 
z 0 , P 0> t, to substitute into (25), to replace z 0 , P 0 by z s , P s , and to insert Eip- Zs .{z' s \ t) 
for In case H does not contain t explicitly, one may use the fact that 

H(z t ,P t ) = H(z 0 ,P 0 ), 

and the task of solving for z t , P t in terms of z 0 , P 0 , t and substituting into H{z t , P t ) can 
be avoided. 

Two notes must be added. First, from the appearance of (25), one might be tempted 
to put Yt as 

where is the eigenstate for z t with the eigenvalues z' t . Using the convention for 
employing the primes so that z’ t takes the same numerical value for all t, it is possible 
to find d^^jdt from the law of change of z t , and with that it is possible to get from 
(25) an equation giving d(z' t \ t)/dt in terms of (z' t \ t). It is true that this equation is 
usually simple, but we have the disadvantage that E(t j z' t ) z' t (/ t \ t) is not the expected 
value of z at time t. To get the desired value 4> 0 z t ijr 0 or <^z 0 rjr t with the help of (z' t 1 1), 
it is necessary to know (zj | zl), which carrnot be obtained unless the equation of 
motion for z t is integrated. Secondly, it is easy to note that the procedure of replacing 
H(t, z t , P ( ) by H(t, z a , P s ), as is actually done by current practices is wrong. Consider 
the' various derivatives to t of 

H{t,z t ,P t )-H(t,z 0 ,P 0 ) 

taken at time t = 0 . They are zero only when, at t = 0 , 

etc. Since this is in general not so, H(t, z t , P t ) and H(t, z 8i P s ) are not equal. 


4. Application to electrolynamics 

The foregoing theory of quantization for fields whose field equations are deduced 
from varying a Lagrangian subject to auxiliary conditions can sometimes be applied 
with advantage to fields whose field equations are deduced from varying a Lagrangian 



323 


A note on the Hamiltonian theory of quantization 

under no auxiliary conditions but with some of the P’s missing. Let the latter field 
equations admit the integrals/£(#% z“, z%) — 0. Then, evidently, these field equations 
can be deduced from varying the same Lagrangian subject to = 0. Thus the 
methods in §§ 1,2 ,3 may be applied. Generally speaking, it may be found that in 
so doing all the new P’s are present. If this is so, the quantization may proceed in 
a straightforward way. 

To illustrate this, let us quantize the Maxwell field using the auxiliary condition 

y.A+^ t = 0-t (26) 

This is different from the procedure adopted by Fermi (1932) and discussed by 
Dirac (1935); they assume that 

0 (27) 


holds for rfr representing a state that can actually occur in nature. Here ( 26 ) is an 
operator equation and thus holds for all 

Let ds 2 sb 


(p, 6 , ... run from 0 to 3, and r, s, ... from 1 to 3), and assume for simplicity that the 
g’& are constants. Hence leaving out the Lagrangian which involves the matter only, 

We haVe r 1 p# dz p _dz? 3z«\ . e + / 28 ^ 

S7T\du B dv? du° duPj 


where dj 6 /du e — 0. A straightforward application of the methods, of the above 
sections gives /0 0 „ O \‘ 

(29) 

*- (30) 

where P£, PJ are written in place of P ri P 0 in order to show its transformation cha¬ 
racter. The canonical equations are 

dz r dz° _ ] 

du r du° ; • 


and 




. 1 d fdz r 3 z®\ 

^ r 4 e 7 T du 8 \ 3 w s du r ) 




( 31 ) 


t" The velocity of light c is taken to be unity for convenience. 
X In this section/# are treated as c-numbers. 


Vol. 183. A. 



324 


T. S. Chang 

of which the last three, apart from terms containing Pq, are precisely Maxwell’s 
equations. As can be seen, the auxiliary equation appears in the canonical equation 
for bz°jbu 0 . 

To get down to Maxwell’s equations, it is sufficient to assume that for occurring 
in nature, . ~ v 

Pof = o. I-jo + a^ p o )^ ==0 (32) 


at u° — 0 . This is of the same nature as the supplementary condition (27) mentioned 
in the be ginning of this section. It is not difficult to see from (31), which holds for 
any \fr, that if ijr satisfies (32) at a certain u°, it does so for all'subsequent u°. Thus we 
have the first two equations of (31) and 


._1_ J_/9Zr 

4m 3m*\3m s 



for all u°. In the following, we shall assume the commutation relations 

' [P„P]_-[*,*]_- 0, e[P° p (u),z<>(u')]_ = d e p $(u-u'), (33) 

* 

etc.; hence the operator equations = 0 and —j 0 4* ( d/du r ) Pq = 0 at u° = 0 cannot 
be assumed on account of their incompatibility with (33). 

Let us employ the co-ordinates up = u p — (it, x, y, z) so that g p& = and the e in 
equation (24) is equal to — l/ft. On putting 

Zg- = E\ k e ite , Pg = EPg t k e ikx , j e = Zj 9> k e itat , 
k = (&*, k v , k z ), x=(x,y,z), 
the assumed commutation relations become 


\.Pp,~Y> z e, k]— — ft&pe ^kk'J (34) 

etc. To proceed further, choose the axes for each Fourier wave so that k becomes 
(k, 0 , 0 ), k being + *J{Zkf). Let us perform a transformation 

*o, k = e«[-*/+_*-/ot_ k (P«+#)] + - *»)],' . 

%,k = «“[/£-* ~ ifZMW - Ji)] + e~ ikl [f^+if- k ($kt + if)], 

^.k = e"/ 2 t_ k +e-®/ 2 - k , 

z 3.k = em fs,-k + , or , 

(3o) 

{4mjk) P 0 , _ k = - if£ k e m - »/£_* e~ M , 

(4 Tr/k) Pi, _ k = / 0 +k e" e~ m , 

{4njk) P 2 , _ k = / 2 +k ~fz- k &~ ikl , 

{4mjk) P z _ k = f£_ k e ikt -f z _ k e~ ikt , 

where P lt _ k denotes the component of P_ k along the direction of k, not that of -k. 
The / are therefore dynamical variables involving t explicitly. The relations (35) 



A note on the Hamiltonian theory of quantization 325 

are so chosen that for the case/ = 0, the /’ s are constants of motion. In terms of/, 
the commutation relations become 

U + ,f + l- = [/-/-]- = 0, t fU>fe^-=- 2 -^S pg S w . (36) 

The reality condition of the field is given by 

flk = "~/ok> fok = ~/ik> fzk —fik’ fsk “ /ik’ (3?) 

with 5 denoting the adjoint of v. The integral of i? over a?, y , z becomes 

■ #1 — —Ejo,- k%,k> 

z .2 . 

<®rad. = +/4/2k +/3k/3k +/ak,/ 3k + ^/ok/ok + Ik f Ik ~~f lk foki> (®^) 


and the initial conditions on ijr become the equations 

(/otk-^'o,-k e ~ ifci } ^ = °> {/Ck+^VVk®"} f = °> ( 39 ) 

for £ = 0 . According to the above, if we assume (39) to hold at £ = 0 , they hold 
for all t. 

A further transformation 


ft<)k - ^ ^ + /ik)> a ok - ^ ^ 0k 

ftj.k = ^ 2 (/oik - /ik)i ■ a ik = ^(/ok+/i£)> 

^2k “ (/2k“t/2k)> ®2k *“ — '^U2k~fZk)> 

bsk = ^ (/at"*"/®*)’ tt 3k — _ ^2 (/3k — /»k)> 


gives real o and 6 satisfying 

■^rad. becomes 

,P 


[»,<*]- = [&, ft]- = 0 , [ft pk ,^ k ]_ = -~-S pg S a- 


(40) 


(41) 


E {ft2kft2k + a 2k®2k + ft3kft3k 'b®3 k ®3 k + 2(6 lk a ok — 6 #k U lk ) + f(ftokft<)k'b ftx k fti k )}, (42) 


and the initial condition on ijr 

(ftok-rok) ^ = (ftik-7ik) f = 0 


for < = 0 , where 


>0k = - (io,-k e~ iK - io > k e<fti ). 


7ik = 0'o.-k +io,k *“) ■ 


( 43 ) 


22-2 



326 


T. S. Chang 


5. Continuation 

New pass to the Schrodinger equation. According to the discussion in§ 3, it runs as 

= t H = H isA +H 1 + H u , (44) 

where H M is the Hamiltonian for matter. Let a be the dynamical variables for 
matter. Let the dynamical variables x, z, P. etc., at t = 0 coincide with the respective 
fixed variables a a , z s , P a in the Schrodinger picture, and let H(oc,z,P) not contain t 
explicitly, so that H(a. t ,z t ,P t ) may be replaced by H{a a ,z a ,p a ). The equation (44) 
becomes 

in -(zX 1 1) = Z{z'jx' a | H I z'X) (*;«; 1 1). (45) 

It may be noted that the variables z 8 , P s may also be, transformed to f s or a 8 , b 3 , the 
transformation law being (35), (40) with t put as zero. This we shall do, obtaining 
H(oc s ,a s , b s ), Which involves no t explicitly. In the following, all notations con¬ 
cerned with dynamical variables refer to fixed variables in the Schrodinger picture 
(unless otherwise mentioned), and the suffix s will be dropped. 

The representative (a r , ^..., ..., b ..., b[ k . .. j —) will be used, where 

k 

n 2k = (^2k^2k + ^2k^2k) ~ h 

n Zk — (^3k^3k + a 3k a 3k) ~ h 

( t 7rk\ ^ 

-j-j (n 2k ^e imk + e~ im)c n2)J t ) ) 

( n fi\h 

- j \ Hn ^ e ^- e -^ n ^ i ), 

__ 2nHi d 2ttM 8 

a ' ok “~F % ^"Taft7 k ’ etc * 

( e iw , er iw are well-known matrices.) Substitution of the above into (45) gives a 
rather lengthy equation, but it may be simplified as follows. 

After assuming (43) at t = 0, these equations follow for all t. Thus 

(^0k If Qk)tf $ — 0? (^lk ~ 7lk)* ^ = 0 (46) 

for all t in a Heisenberg picture, or 

(6 0 k~“ 7ok)i = 0^ = 0j (^lk 7lk)f ~0 (4X) 

t It is hoped that no confusion arises from using H to denote what should be denoted by 
f Hdxdydz. 



A note on the Hamiltonian theory of quantization 327 

in a Schrodinger picture. On taking a representation in which all j ok are diagonal,! 
we find 

[a',n' ...\t) = (a’,n'; !k ...,n' 3k ...\t) 

x n n «*(i4-^uu+&j)4 («> 

Substituting this into the Schrodinger, equation, multiplying by 

7 rr6®(b' 0 + ...), 

and integrating over all b' Qk and b' lkl all the terms in H containing a 0 or a x are dropped 
out. The simplified equation becomes 

3 * 

t*(«', »4 • • • > »4 • • • I«) = S (a' | H I cc") (a",»'... 1 1 ) 

+S*«(»4+*4) («v... I I *)- s ss 

k k /c 2,3 k \ K ‘ ) 

x(a',n'... | TO 2k ie l '“ ,2k j 2ik +e-™ 2k n 2k ij/' 2> _ k | a",ft" ...) (a",ft" ... | <)• (49). 

The third term in the right-hand side is precisely the Coulomb energy times the 
representative (a'n 1 1 1). 

From (48), it is not difficult to see that the probability of having a taking the 
values a', n taking ft', is proportional to | (aV 1 1) | 2 , and the average value of 
n to 27ft' | (a’n' 1 1) | 2 . If (fi'z'P' \ t) is required, where /? is any other dynamical 
variable of matter, we have only to apply the ordinary transformation theory. 
All the information required can be derived from (fi'z'P' \ t). 

The present quantization amounts to shifting the supplementary condition from 
(27) to equations (32) for t = 0. It b,as the advantage of having P 0 and the commuta¬ 
tion relations (33) which do not place z 0 , P 0 and the other z, P on different footings. 
It enables one to calculate the complete wave function (a'n'b 1 1 1) in a not too com¬ 
plicated manner. But as can be seen, both from the general character of the theory 
and from the equation (49), the usual divergencies in the field theory still remain. 
Hence, practically speaking, the improvement brought forth by the present theory, 
if any, is small. 

The present quantization can be applied to Proca’s field. There we see that from 
the field equations we may obtain dz e /du e = 0, and thus we may introduce it in 
the very beginning as an auxiliary condition. In the subsequent development, the 
departures from the above calculations are of a trivial kind and will therefore be 
omitted. 

f This can be ass um ed if / 0 at different x and the same t commute, which holds generally. 

J +oo 

[£*(#)] 2 = 1. It is introduced to make 

-00 , . 

J | (a', n f b f | —) | 2 db f finite. Evidently it can be written as {£(#)}* without causing confusion. 

Though in ordinary mathematical analysis the usage of such functions warrants special care, 
they can be used rather freely in quantum mechanics. 



328 T. S. Chang 

In. conclusion, I wish to thank my former teacher, Professor R. H. Powler, for 
his kind encouragement. 

References 

Dirac 1935 Quantum mechanics, 2nd ed., last chapter. 

Fermi 1932 Rev. Mod. Phys. 4, 125. 

Heisenberg & Pauli 1929 Z. Phys . 56, 1 . 

Weiss 1936 Proc. Roy. Soc. A, 156, 192. 

Weiss 1938 a Proc. Roy. Soc. A, 169, 102. 

Weiss 19386 Proc. Roy. Soc. A, 169, 119. 


The occurrence and properties of molecular vibrations 

with V(x) = ax* 

By R. P. Bell 

(Communicated by C. N. Hinshelwood, F.R.S.—Received 16 November 1943) 

It is shown that in certain inodes of vibration of plane rings the potential energy for small dis¬ 
placements is proportional to the fourth power of the displacement, provided that there is 
free rotation about the bonds of the ring. This type of vibration is termed a 4 fourth-power 
vibration’. It is likely to occur in cyclobutane and its derivatives, in a number of halides 
having the formula and in the hydrides of group III elements. 

The energies and wave functions of the first four levels of a one-dimensional oscillator with 
V(x) = ax* have been derived by a method of successive approximations, and asymptotic 
formulae are given for the higher levels. The wave functions are qualitatively similar to those 
of a harmonic oscillator, but the energy levels differ considerably. A comparison is made 
between energy levels for oscillators with V(x) — a and different values of q. 

The selection rule for dipole radiation from a fourth-power vibration is discussed. Over¬ 
tones will be more numerous than in the spectrum of a harmonic oscillator. Estimates are 
made of the spectrum frequencies of fourth-power vibrations in actual molecules, with 
special reference to cyclobutane and diborane. For these two molecules there are observed 
infra-red frequencies of approximately the expected value. The isotope effect should provide 
a means of discriminating experimentally between harmonic and fourth-power vibrations. 

The contribution of a fourth-power vibration to any thermodynamic function will differ 
from that of m harmonic vibration with the same fundamental spectrum frequency. Figures 
are given for the specific heat, where the difference should be detectable experimentally. In 
the general case V{x) = a Q \xP\ the energy levels derived from the quantum theory lead to 
expressions for the thermodynamic functions which agree with the predictions of classical 
theory at high temperatures. 


Introduction 

The energy levels and wave functions of a one-dimensional oscillator have been 
derived for a number of different potential energy functions, the simplest of which 
is the harmonic oscillator with V (#) = ax 2 . All these functions resemble the harmonic 
oscillator in that the plot of V(x) against x has a finite radius of curvature at the 
equilibrium point x = 0, i.e. if V(x) is expanded in powers of x near x = 0, the 



The occurrence and properties of molecular vibrations 329 

coefficient of x 2 is not zero. Similarly, in the normal co-ordinate treatment of mole¬ 
cular vibrations it is always assumed in the first instance that the potential energy 
is a quadratic function of the displacements, further terms being introduced only as 
corrections to the quadratic form. It will be shown in the present paper that there 
may be some types of molecular vibration for which these methods of treatment 
are inadequate, since for small displacements the potential energy is proportional 
to the fourth power of the displacement. In these cases if V(x) is expanded in powers 
of x near x = 0, the coefficient of x 2 is zero, and the plot of V(x) against x has an 
infinite radius of curvature at x = 0. It is therefore of interest to investigate the 
properties of a one-dimensional oscillator in which V(x) = ax*, and a vibration of 
this type will be referred to as & fourth-power vibration. 


Vibrations or plane rings 

The kind of vibration described above may occur in molecules containing a plane 
ring of four or more atoms, which may have other atoms or groups attached to them. 
This can be illustrated by considering the ring A BCD in figure 1. In one of its modes 
of vibration the atoms will move initially at right angles to the plane of the ring, 
as shown by the arrows. It is easily seen from the geometry of the system that a 
first-order infinitesimal displacement x produces only a second-order change in the 
distances between any pair of atoms, or in the angles between any pair of bonds 
attached to the same atom. If therefore the forces in the ring can be represented by 
either a valency-force or a central-force system, the expression for the potential 
energy will contain no term in x 2 , and if the usual normal co-ordinate treatment is 
carried out one frequency will turn out to be zero. 



A somewhat similar position is met with for the out-of-plane bending vibration 
of the planar molecule XY Z , which is assigned a zero frequency by the ordinary 
valency-force or central-force treatment. However, in this case, although there is 
no first-order change in single valency-angles, there is a first-order distortion of the 
disposition of valencies round the atom X , which may be expressed in terms of the 



330 


R. P. BeU 


angle between the bond X Y and the plane Y s . It is therefore not surprising that for 
boron trichloride the force constant for the out-of-plane bending is approximately 
the same as for bending in the plane of the molecule (Anderson, Lassettre & Yost 
1936). On the other hand, in the vibration represented in figure 1 the valencies 
attached to any given atom suffer no first-order displacement of any kind relative 
to one another, and this remains true also when there are other atoms or groups 
attached to any of the atoms ABCD . 

The nature of the forces which resist distortion from a plane ring-configuration 
will vary from one case to another, and will be related to the path followed by the 
atoms in a finite displacement. Two extreme cases may be distinguished. In the 
first case, if the forces between unlinked atoms can be neglected, then the distances 
between adjacent (linked) atoms will remain unchanged, the atoms will describe 
circles (e.g. a, a in figure 1), and the restoring force will be due to distortions of 
valency angles. In the second case, if the diagonal distances AC and BD are fixed 
by strong forces, the atoms will continue to move in straight lines, and there wall be 
a change both in the valency angles and in the bond distances A JS, BC, CD and DA. 
In either case it is clear from the geometry of the system that for a small displace¬ 
ment the changes in distances and angles are all proportional to the square of the 
displacement, and hence that the change in potential energy is proportional to the 
fourth power of the displacement. In an actual system the motion would represent 
a compromise between these two extreme cases, resembling the first case more 
closely, since the forces between linked atoms will normally be greater than those 
between unlinked ones. The potential energy will also follow a fourth-power law 
in this intermediate case, and the constant a in V(x) = ax 4, can be expressed in 
term! of the ordinary quadratic force constants of a central-force or valency-force 
system. 

It has been tacitly assumed so far that there is no restriction on the relative 
directions of bonds attached to different atoms, i.e. that there is free rotation about 
the bonds in the ring. If rotation is not free there may be a quadratic term in the 
potential energy even when there are no groups attached to the ring atoms: for 
example, in the motion represented in figure 1 the bonds AD and BC are twisting 
about AB. We can therefore exclude from consideration here plane rings con¬ 
taining bonds of higher order than single bonds (e.g, benzene), since in these there is 
undoubtedly restricted rotation about the ring bonds. In the cycloparaffin series 
tings containing six or more atoms are non-planar, so that the present treatment 
a PPty directly only to cyclobutane and its derivative, and possibly also to cyclo¬ 
pentane, though there is some evidence (Aston, Fink & Schumann 1943) that the 
latter may not be planar. There are also many compounds of formula X 2 Y 6 which 
contain a planar four-ring of single bonds: thus dimeric al uminium chloride can be 
written 


+ 




331 


The occurrence and properties of molecular vibrations 

and there is direct evidence that this configuration is correct for the halides of 
aluminium, indium and gallium (Palmer & Elliott 1938; Brode 1940). The same 
structure probably applies to other dimeric halides such as Fe a Cl 6 and Au 2 C 1 6 , and 
it also occurs in more complex co-ordinated halogen compounds (Mann 1938). More¬ 
over, it has been recently shown (Longuet-Higgins & Bell 1943) that the hydrides 
B 2 H 6 and Ga 2 H 6 probably have the bridged structure 



and that the same type of bridge linking occurs in many of their derivatives. 
There are thus many examples of planar four-rings containing no multiple bonds, 
which are of interest in the present context. 

There is probably always some resistance to rotation even about single bonds, 
which may introduce an energy term proportional to x 2 in the type of vibration 
considered above. Experimental and theoretical knowledge on this question is at 
present very incomplete, but the available data suggest that hindered rotation is 
unlikely to be the decisive factor in the present problem, though it is likely to 
produce a correction to the energy expression V(x) = ax 4 for relevant values of the 
displacement. In any case a study of the ring vibrations should give information 
about the actual form of the potential-energy curve, and hence about the extent 
to which rotation is restricted. It should be noted that the existence of an appreciable 
hindrance to rotation will in general modify the normal strain-free equilibrium 
configuration of the ring, e.g. cyclobutane would not be exactly planar. However, 
electron diffraction measurements on methylenecyclobutane (Bauer So Beach 1942) 
indicate a planar configuration for this compound, and the vibrational spectrum of 
cyclobutane itself (Wilson 1943) does not admit of a non-planar structure: hence it 
is probable that resistance to rotation is small in this compound. 


ENERGY LEVELS AND WAVE FUNCTION'S 


If the potential energy is given by V (x) = a# 4 , the wave equation is 


d 2 ijr 87r 2 m 
da? + ~W 


(W ~ax*)i/r = 0, 


( 1 ) 


which on making the substitutions 


£ = 


87r 2 ma * , 

~W~\ x ’ 


= 4Tf|- 


nhn 2 )* 

Tfia] ' 


( 2 ) 


becomes ’ 


d 2 f 
d,? 


+ (A-£‘)f = 0. 


( 3 ) 


Equation ( 3 ) is not soluble by known functions or in series, and the general equations 
given by Dunham (19326) for energy levels are not applicable in the absence of a 



332 


R. P. Bell 


term in £ 2 . The numerical difference-equation method of Kimball & Shortley (1934) 
gives both eigen-values and eigen-functions, and we have applied it to the first four 
levels of equation ( 3 ), using a difference AE = 0-2. Starting with a trial function of 
roughly the right form, satisfactory convergence is obtained after 4-6 approxima¬ 
tions. The eigen-values of A thus obtained are given in column (a) of table 1. They 
have an estimated accuracy of about 2 %. Figure 2 shows the first four wave func¬ 
tions obtained by the method given in table 1, which are qualitatively similar to 
those for a harmonic oscillator. The difference is illustrated by the broken curve, 
which is the wave function of the lowest state of a harmonic oscillator having the 
same energy as the lowest-state of (1). 



Figure 2 


Table 1. Eigen-values of equation (3) 



(a) 

(&) 

(c) 

(d) 

A„ 

1*09 

0*99 

0*87 

1*061 

Ax 

3*81 

3*82 

3*76 

3*800 

^■2 

7*36 

7*52 

7*41 

— 

A 3 

11*40 

11*66 

11*61 

— 

a 4 

— 

16*27 

16*23 

— 


• The energy levels thus obtained can be checked by the Brillouin-Wentzel-Kramers 
(B.W.K.) method of approximation (Dunham 1932a). For any potential-energy 
function V(x) the first two terms of the expansion are 




333 


The occurrence and properties of molecular vibrations 
Putting V(x) = ax i and y — V(x)/W n , and using ( 2 ) it follows that 

(2n+l)7r = iAU-fA^4 

h = = immmi) =ms, 

k = j(i-y)-*y*dy = 4T(i) A-t)/Ai) = 2-636. 


(5) 

( 6 ) 


The values of X n calculated from equation ( 5 ) are given in column ( b ) of table 1. The 
agreement with column (a) is satisfactory except for the lowest level, which is just 
where the B.W.K. method is expected to be least reliable. A still simpler expression 
for the eigen-values of A is obtained by omitting the last term of equations ( 4 ) and 

a.-{»*± 3}*. m 


The values calculated from equation ( 7 ) are given in column (c) of table 1: except 
for the lowest level they are sufficiently accurate for most purposes. A further check 
on A 0 and A x is provided by the work of Milne (1930), who used a laborious but accurate 
numerical method, giving the values in column (d) of table 1. It may be noted that 
the addition of a third term to the B.W.K. expression ( 4 ) gives A 0 = 1 - 16 , which 
differs more from the accurate value 1-061 than does the second approximation 0 - 99 . 
This is consistent with the fact that the B.W.K. method involves an asymptotic and, 
not a convergent expansion, and it is thus doubtful whether the inclusion of the 
second term in equation ( 4 ) produces in general an improvement on equation ( 7 ). 
The early divergence of the asymptotic series in the case V(x) = ax 4 can be related 
to the proximity of two imaginary zeros of the expression W—V(x) (Kemble 
1935). In subsequent calculations values in column (d) have been used for A 0 and A x , 
those in column (a) for A 2 and A s , and column (c) for A 4 and higher levels. 

It is interesting to note the change in the sequence of energy levels for the general 
symmetrical potential well V(x)cc \ ofl j as the value of q is changed. Some of these 
- values are given in table j|, the energy of the lowest level being taken as unity in 
each case. 

Table 2 


potential-energy function energy levels 


V(x) oc | x | 

1 

2*29 

3*18 

3*98 

4*72 

V(x) oc x 2 

1 

3 

5 

7 

9 

V(x) oc x* 

1 

3-58 

6*99 

10-95 

15*31 

F(a?) = 0 for \x\<a\ 
V{x) = 00 for j x j > a) 

1 

4 

9 

16 

25 


The values in the first row have been derived elsewhere (Bell 1944), while the last 
row represents a rectangular potential box, corresponding to q-*co. These results 
may be compared with expressions previously derived for the permeability of a 
potential barrier with V(x) = T'q(1 — a Q \ afl |), which show a similar gradation (Bell 
1937). The asymptotic formula ( 15 ) shows that the gradation persists in the higher 
levels. 



334 


R. P. Bell 


Selection rules and spectrum , 

In the absence of explicit wave functions it is not possible to derive general selec¬ 
tion rules for fourth-power vibrations. However, it is clear from symmetry con¬ 
siderations that the matrix element xjf m rjr n xdx is zero when m and n are either 

J —00 

both odd or both even, so that dipole transitions to or from the ground state i/r 0 can 
involve only odd upper states. Numerical calculation with the wave functions derived 

above gives = 1'41 and = 0-015. The latter value does not 

differ significantly from zero, which would suggest that the selection rule for dipole 
radiation is n->n ± 1, just as for the harmonic oscillator. However, it can be shown 

from the form of equation (3) that non-zero values of jifr m rfr n £dg cannot always be 

restricted to m = n ± 1,* and it seems likely that all transitions are allowed provided 
that m and n differ by an odd integer. 

If the frequency for the transition 0 f 1 is taken as unity, then the frequencies of 
the overtones Of*-1, Of-2, Of 3, Of 4, etc., are in the ratio 1, 2-32, 3-85, 5-54, etc., 
compared with 1, 2, 3, 4, etc., for a harmonic oscillator. Moreover, the transitions 
1 f 2,2f 3,3f 4,4f 5, etc., will contribute other possible overtones with frequencies 
1-32, 1-53, 1-69, 1-82, etc. The spectrum of a fourth-power oscillation is thus likely 
to contain more overtones than that of aharmonic vibration of comparable frequency. 
In terms of the constant a the lowest spectrum frequency v 0 is-given by 

(8 ) 

It should be noted that this frequency is not identical (as for the harmonic oscillator) 
with the vibration frequency assigned by classical mechanics. This mechanical 
frequency v' depends on the energy W, and is easily shown to be 


_ 2P(i) 

V(2 mn)r(i) 


(Wa)i. 


For an energy W 0 corresponding to the lowest quantum state this becomes 


_ Aim/hay 

0 7r*r(i) [my 


— 0-133 = 0-81j> 0 , 

\m 2 / 0 


(9) 


( 10 ) 


i.e. the fundamental’ spectrum frequency is somewhat higher than the mechanical 
frequency of the system in its lowest state. According to the correspondence priri- 
ciple, as the quantum number n increases, the spectrum frequency for the transition 
n 1 will approach more and more closely to the mechanical frequency of the 
system, and it is easily shown directly from equations (7) and (9) that this is so in 
the present case. 

* The author is indebted to Professor E. C. Titchmarsh for pointing this out. - 



The occurrence and properties of molecular vibrations 335 

It is of interest to determine the approximate frequency of fourth-power vibra¬ 
tions in actual molecules, as a guide to their experimental detection. To do this the 
simple case of a square ring of four identical atoms is considered, assuming that 
forces between unlinked atoms can be neglected, and that the force constants for 
bond stretching are very large compared with those for valency bending. There are 
then two types of vibration which the ring can execute (without altering the bond 
lengths): (a) vibration in the plane of the ring, the atoms moving along the .diagonals 
of the square, and ( b) vibration at right angles to the plane of the ring, the atoms 
moving along the circles indicated in figure 1, Tor small displacements (a) is a 
simple harmonic vibration, while ( b) is a fourth-power vibration. If A is the displace¬ 
ment of each atom, b the bond length, and d the ordinary bending force constant in 
dynes/cm., then the potential energy of the whole ring in the two cases is given by 

F 0 = 4.1.8 dAl, = (11) 

Using the ordinary harmonic oscillator expression and equation ( 8 ), one finds for 
the lowest spectrum frequency in the two cases 



and hence for the ratio of the two frequencies 



On putting reasonable values for m, d and b into equation (13), we find v 6 /v 0 *== 0-7 — 0 - 2 . 
This value applies directly only to the simple model which is taken here, but it should 
be of the right order of magnitude in more complicated cases. Since the harmonic 
bending frequencies v a lie in the range 150-1500 cm . -1 (according to the mass of 
the atoms), the fourth-power frequencies should be frequently detectable in the 
infra-red or Raman spectrum. 

The molecule which approaches the model most closely is cyclobutane. If each 
CH 2 group can be considered to move as a whole, and interaction between unlinked 
atoms is neglected, then the second part of equation ( 12 ) should give an accurate 
value for the fourth-power frequency v b . Wilson ( 1943 ) has recently given an 
experimental and theoretical study of the vibrational spectrum of cyclobutane, 
obtaining a value of 3-28 x 10 * dynes/cm. for the CCC bending force constant of 
the ring. With this value equation ( 12 ) gives v b = 120 cm. -1 . This frequency is too 
low to be observed directly in the infra-red spectrum, but on the basis of observed 
combination tones Wilson has assigned the value 145 cm . -1 to the out-of-plane 
bending frequency of the ring (which he regards as harmonic), which is somewhat 
larger than the figure calculated here. The discrepancy of 20 % suggests that 
restricted rotation is of appreciable but not decisive importance in treating this type 
of vibration. In the case of diborane the vibration spectrum has not;yet been 



336 R. P. Bell 

analysed on the basis of the bridge structure, but the value of v b can be estimated 
from equation (13). v a appears in the Raman spectrum as 793 cm. -1 , being identified 
by the boron isotope effect (Anderson & Burg 1938 ), while electron diffraction data 
give b = 1*39 A (Longuet-Higgins & Bell 1943 ). Then from equation (13) it is found 
that Vj,=£= 450 cm. -1 , and there is in fact a strong infra-red band at 4l2~cm . -1 which is 
otherwise difficult to explain (Stitt 1941 ). Close numerical agreement with equation 
(13) would not be expected, since it is certainly unjustifiable to neglect the inter¬ 
action between unlinked boron atoms in calculating v a and v b . 

Further identification of these frequencies as fourth-power vibrations might 
occur through their overtones, or through the thermal properties dealt with in the 
next section. However, the most direct verification would be by means of the 
isotope effect, since equation ( 8 ) shows that v oc m - *, instead of the usual v 0 c m~*. 
This test should be particularly valuable for diborane and similar compounds, 
where hydrogen atoms form part of the ring. In other cases the isotope effect can be 
approximately simulated by attaching groups to the ring, e.g. by comparing cyclo¬ 
butane with alkyl-cyclobutanes. Since the fourth-power vibrations have a consider¬ 
ably lower frequency than the other vibrations of the system, the primary effect of 
substitution is merely to increase the mass of the units composing the ring. 

In dealing with the spectrum and with the thermal properties, the fourth-power 
vibration is being treated as if it were one of the normal vibrations of the system, 
and thus qualified to make an independent contribution to the kinetic and potential 
energy of the system. This is not strictly legitimate, since the whole theory of normal 
vibrations depends on the assumption of harmonic vibrations of small amplitude. 
However, on account of the low frequency of the fourth-power vibration, it seems 
physically reasonable to treat it independently of the other modes of vibration, at 
least as a first approximation. 


Thermodynamic properties, 

The contribution made by a fourth-power vibration to the various thermo¬ 
dynamic functions can be evaluated by the standard methods. For high tempera¬ 
tures equation (7) can be used in conjunction with the Euler-Maclaurin summation 
formula, but for lower temperatures it is necessary to use the more exact energy 
levels and to sum term by term. For a given fundamental spectrum frequency v 0 , 
the contribution of a fourth-power vibration to any thermodynamic function will 
differ from that of a harmonic vibration. The extent of the deviation is illustrated 
by table 3, which gives the molar specific heat as a function of hvJkT. It will be 


Table 3. Specific heat contributions 
( a)=harmonic vibrations; (6) ='fourth-power vibrations 


hvJJcT 

0 

1-0 

1*5 

2-0 

3-0 

4*5 

6*0 

C * /B {(6) 

1-000 

0-921 

0*832 

,0*724 

0*496 

0*230 

0-0898 

0-750 

0-802 

0*685 

0*620 

0*460 

0*219 

0*0897 



337 


The occurrence and properties of molecular vibrations 


seen that at low temperatures there is little difference between the two types of 
oscillator, corresponding to the fact that only the two lowest levels are effective in 
determining the specific heat. At higher temperatures the difference is considerable, 
and should be detectable experimentally. At intermediate temperatures the value 
of cJB for the fourth-power vibration rises slightly above the limiting high-tem¬ 
perature value of 3/4. This limiting value is in accordance with the classical equi- 
partition theory, according to which any form of energy proportional to the qth 
power of a co-ordinate or a momentum contributes BTjq to the total energy per 
mole: in the present instance the kinetic energy contributes \BT and the potential 
energy \BT. 

It is interesting in this connexion to consider the more general oscillator repre¬ 
sented by 

?(*)-«« I ** |, (14) 


where q can have any positive value. For high quantum numbers the energies are 
given by the first term of equation (4), which reduces to 


rni+t _ <rfo+|)K /g m/g+t) 

n ~ 2V(2ro7T) I\llq) ■ 


‘(15) 


At high temperatures n can be treated as a continuous variable and the summa¬ 
tions in the partition function replaced by integrations. Putting t = WJkT this 
gives for the mean energy per mole 



RT PV/«+*e-'cfc „ .. 
_ h -a. {-+-) 

e-‘dt ^ 


RT, 


in agreement with the classical result. 


(16) 


References 

Anderson, T. F, & Burg, A. B. 1938 J. Ghent. Phys . 6 , 586. 

Anderson, T. F., Lassettre, E. N. <fc Yost, D. M. 1936 J. Ghent. Phys . 4, 703. 
Aston, J. G., Fink, H. L. & Schumann, S. C. 1943 J. Anter. Ghent. Soc. 65, 341. 
Bauer, S. H. & Beach, J. Y. 1942 J. Anter. Ghent. Soc. 64, 1147. 

Bell, R. P. 1937 Ptoc. Boy. Soc . A, 158, 128. 

Bell, R. P. 1944 Phil. Mag. (to be published shortly). 

Brode, H. 1940 Ann. Phys., Lpz. (5), 37, 344. 

Dunhanpi, J. L. 1932 a Phys. Bev. 41, 713. 

Dunham, J*. L. 19326 Phys . Bev. 41, 721. 

Kemble, E. C. 1935 Phys. Bev. 48, 549, 

Kimball, G. E. & Shortley, G, H. 1934 Phys. Bev. 45, 815. 

Longuet-Higgins, H. 0. & Bell, R. P. 1943 J. Ghent. Soc. 250. 

Mann, F. G. 1938 Ann. Rep. Ghent. Soc . 35, 148. 

Milne, W. E, 1930 Phys . Bev. 35, 863, 

Palmer, iC J. & Elliott, NT. 1938 J. Anter. Ghent. Soc. 60, 1852. 

Stitt, F. 1941 J. Ghent. Phys. 9, 780. 

Wilson, T. P. 1943 J . Ghent. Phys. 11 , 369. 



Absorption of radium (B + C) gamma-rays 

By J. E. Roberts, D.So., FJnst.P. 

Barnato Joel Laboratories, The Middlesex Hospital 

(Communicated by R. C. Stoner, F.R,S.—Received 12 January 1944 ) 

New measurements of the absorption of filtered gamma-rays from radium (B + C) in alu¬ 
minium, carbon and lead have been made. A small condenser type of ionization chamber 
has been used, which overcomes many of the difficulties usuaUy inherent in this kind of 
measurement. Detailed consideration has been given to the corrections which must be 
applied to ionization measurements before absorption coefficients can be calculated. 

From the most recent theories of absorption by scattering, photoelectric effect, and 
electron pair formation, total absorption coefficients for lead over the wave-length range 
concerned have been calculated. The problem of the effective wave-length of a filtered 
heterogeneous gamma-ray beam is discussed in some detail, and a proposed method of 
estimation put forward. Comparison of the calculations of fi/p with the experimental figures 
indicates that the photoelectric absorption rises more rapidly with increasing wave-length 
than is predicted by theory. 


INTRODUCTION 

A number of attempts have been made to determine experimentally the absorption 
coefficients in lead and other materials of the gamma-rays of radium (B + C), under 
varying degrees of filtration. Although experimental methods do not differ funda¬ 
mentally, the results obtained have not been in very good agreement. The classical 
work on the subject was carried out by Kohlrausch (1917), and his results have been 
quoted and used by most workers. Since that time Ahmad (1924a, 19246, 1925) 
has made a detailed investigation of the absorption, in a number of materials, of 
gamma-rays of a fixed quality (1 cm. lead filter). The results of these and other 
experimenters have been summarized by Rutherford, Chadwick & Ellis (1930). 

The calculation of absorption coefficients from the experimental data involves 
the evaluation of corrections connected with scattering and with energy absorption 
in the ionization chamber. With the usual experimental arrangements these cor¬ 
rections may be large and are difficult to estimate with precision. This leads to some 
uncertainty in the absolute values of the absorption coefficients. In recent years, 
due to a large extent to the development of the medical uses of radium, considerable 
advances have been made in methods of measurement of gamma-rays, and in 
knowledge of the absorption processes. The purpose of the present work has been to 
use these methods in an attempt to eliminate the disturbing factors as far as possible, 
and where this could not be done, to make reasonably accurate allowances for them. 
The experiments have been limited to the gamma-rays of radium (B + C), with 
filtrations up to 5 cm. of lead, using absorbers of lead, alu miniu m and carbon. 
Having obtained the absorption coefficients, a further attempt is made to analyse 
the results and correlate them with the most recent theories of absorption. 

[ 338 ] 



Gamma-ray absorption 


339 


Apparatus 
General arrangement 

The general arrangement of the apparatus for absorption measurements follows 
the principles laid down by Kohlrausch (x 917 ). A beam of gamma-rays from a radium 
source R (figure 1 ) is canalized by means of a series of heavy lead stops before falling 
on the ionization chamber 0. With the ionization system described below, a high 
degree of canalization was not required. There is no electrometer or other accessory 
apparatus in the vicinity, and it was only necessary to prevent excessive scattered 
radiation from surrounding objects from reaching the ionization chamber. The stops 
were built up of lead ‘bricks’, S 1 being 8 cm. thick, and S 3 and S s each 4 cm. thick. 
The aperture in S x was 4 cm. square, and in the other stops 3 cm. square. Filters up 
to 5 cm. thick could be placed between S 1 and S 3 , while S 2 and S 3 were separated by 
2-5 cm. to take the absorbers. Between S 3 and the ionization chamber a heavy lead 
shutter could be interposed. This consisted of a cube of 10 cm. side mounted on a 
smooth running trolley, and could be moved quickly in and out of position by hand. 
The whole system was mounted on a wooden table in the middle of a large room, 
thus further reducing the possible effects of stray scattered radiation. 

A single source with small dimensions and of sufficient strength not being avail¬ 
able, a composite radium source was used. This was made up of ten platinum tubes 
of the type used in medical work, and had a total radium content of approximately 
100 mg. The tubes were mounted on a small brass plate, the radium being spread over 
an area of about 3 cm. square. The 1 cos 0 error’ introduced into the measurements 
by this dispersal was negligible. The wall thickness of the radium tubes was 1 mm., 
and this set one limit to the qualities of radiation available. For most purposes this 
1 mm. of platinum was regarded as equivalent in absorption to 2 mm. of lead. 

In order to vary the quality of the gamma-ray beam, lead filters were placed in 
position F (figure 1 ). A series of eight such filters was used, with thicknesses up to 
4 cm., and made up for the most part of sheets of lead 5 cm. square and 0-5 cm. 
thick. Greater filtrations were possible, but the change in radiation quality obtained 
hardly justified the additional work involved. 

The absorbing materials to be investigated, in the form of 5 cm. square blocks, 
were placed in position A (figure 1 ). The main characteristics of the absorbers are 
shown in table 1 .* A density of 11-36 g./cm . 3 indicates a high degree of purity in the 
lead; the aluminium was high-quality commercial material; the carbon absorber was 
a block of Acheson graphite. 

Table 1 



thickness 

density 

superficial mass 

absorber 

cm. 

g./cm . 3 

g./cm . 2 

lead 1 

0-500 

11-36 

5-67 

lead 11 

0-167 

11-36 

1-90 

aluminium 

1-931 

2-71 

5-23 

carbon 

2-00 

1-58 

3-16 


* I am indebted to Mr B. D. H. Watters for accurate measurements of the absorbers. 



340 


J. E. Roberts 


Ionization system 

The choice of an ionization system for precision measurements of gamma-rays at 
a large distance from the radium source presents considerable difficulties, largely 
owing to the very small currents involved. With the arrangement shown in figure 1, 
and using a 100 mg. source, the ionization per c.c. of chamber volume amounts to 
only 4 x 10 5 ions/sec. without filtration, falling to 2 x 10 4 /sec. with 4 cm. lead filter. 
This gives currents between 6x 10~ 14 and 3x 10~ 15 amp./c.c. of ionized air. To 



Figure 1 . Lay out of apparatus. 




Figure 2. Ionization chambers. 


overcome these difficulties, it has been the usual practice to use a very sensitive 
measuring instrument, together with an ionization chamber of large volume. The 
latter inevitably introduces difficulties in the interpretation of the results. Some 
method of integrating the ionic current over a period is clearly desirable. Develop¬ 
ments in recent years of the measurement of gamma-ray dosage for medical purposes 
have shown the very great advantages of the condenser type of ionization chamber 
introduced by Glasser (1928) and Sievert (1932). This consists essentially of a small 
cylindrical or spherical condenser, with a hole in the outer electrode for char gin g 
and measuring purposes. The inner electrode is charged to a known potential, 
the chamber closed, and then placed in position for irradiation. After irradiation 















Gamma-ray absorption 341 

for a known time the residual potential on the inner electrode is measured by sharing 
the charge with a calibrated electrometer. So long as the residual potential does not 
fall below the saturation value for the chamber, the difference between the initial 
and final potentials is a measure of the ionization produced in the air volume. No 
measurements are taken during irradiation. There are no leads to be protected 
from stray ionization, and the electrometer can be kept well away from the radium, 
again requiring no protection. A further advantage is that if a wide range of in¬ 
tensities has to be investigated, the magnitudes of the measured quantities can be 
kept roughly constant by varying the exposure time. A high degree of insulation 
between the electrodes is of course necessary, as a guard ring cannot be used. 

For the experiments here described, two types of condenser chamber were used, 
and they are shown in figure 2. The simplest form (figure 2 a) is as described above. 
The outer electrode A consists of a cylinder of Acheson graphite with internal 
dimensions 1-3 cm. diameter and T 3 cm. long, and wall thickness 0-5 cm. It is 
necessary to use a wall of this thickness (the maximum range of the photo-electrons 
produced by the radiation) in order that the ionization measured should be pro¬ 
portional to the incident gamma-ray energy (Mayneord & Roberts 1937). A 5 mm. 
hole in the end of the cylinder for charging purposes is closed by a graphite plug. 
The inner electrode, B, is suspended concentrically with the outer by means of a 
small Distrene (thermoplastic polystyrene) peg. It also is made of graphite, 0-9 cm. 
long and 0-6 cm. diameter, and has a small button of Elektron metal at the lower 
end to prevent damage to the graphite during charging. The procedure of measure¬ 
ment was as follows. With the outer electrode earthed, the central electrode is 
brought into contact with an insulated steel needle, raised to about 120 V. This 
initial potential is measured on a precision voltmeter. The chamber is then closed 
and placed in a V-shaped holder in a fixed position in the gamma-ray beam. The 
time of exposure can be adjusted so that whatever the radiation intensity, the 
final voltage of the electrode is in the region of 30 V. In these experiments the 
exposures ranged from 10 min. to 4 hr., the timing errors being extremely small. 
After irradiation the chamber is removed, and the inner electrode again brought 
into contact with the steel needle which is now connected to a Lindemann electro¬ 
meter. The latter is calibrated at a fixed sensitivity, using the same voltmeter as 
for the initial potential. The difference between the initial and final electrode poten¬ 
tials is a measure of the total ionic charge collected during the exposure. For most 
experiments an electrometer sensitivity of 30 div./V was sufficient, the capacity 
of the electrometer system being about twelve times that of the chamber. 

In the second type of chamber (figure 2 b) a third electrode, C, was introduced. 
The insulating peg now extends deep into electrode B. and is drilled to take a 1 mm. 
thick wire which projects from the back of the chamber. With this arrangement all 
the electrodes are initially earthed, B, by contact with the steel needle. On being 
placed in the V holder, electrode G is kept at 120 V with respect to A. The ratio 
of the capacities A to B and B to C is such that there is a potential difference of 
some 80 V across the air space between A and B. On irradiation, electrode B collects 

23-2 



342 J. E. Roberts 

the ionic charge, and this is then shared with the electrometer. At the time of 
sharing, a compensating voltage, finely adjusted by means of a potentiometer, is 
applied between G and A. A null reading is therefore obtained. The compensating 
voltage, read on a precision voltmeter, gives a direct measure of the integrated 
ionization during the exposure. Using this chamber, much shorter exposures were 
possible, but, for the sake of accuracy in timing, were kept in the region of 10 min. 
No sig nifi cant diff erences were found between the results from the two arrangements, 
but somewhat greater reliance has been placed on those obtained with the second 
type of chamber. 

A large number of measurements were made to test the consistency of the ioniza¬ 
tion rea ding s obtained with these chambers. For any one set of conditions, the 
maximum deviation from the mean value rarely exceeded 4 parts in 1000, and the 
final mean values for the ionizations are probably correct to 2 parts in 1000. 

Owing to the very low radiation intensities, maintenance of saturation con¬ 
ditions in the chambers did not present any difficulties. To test the saturation, the 
chambers were exposed for increasing periods to the maximum radiation intensity 
used. With chamber a (figure 2 a), the measured intensity (in volts loss per minute) 
remained sensibly constant until the residual potential fell to 16 V. In practice it 
was not allowed to fall below 30 V . Chamber 6 (figure 26) will become unsaturated 
if the charge collected on electrode B reduces the potential difference between A 
and B too far. The experiments showed no indication of this happening over the 
range considered. At lower radiation intensities saturation will be e^en more 
complete. 

As some of the series of measurements were spread over a fairly long period of 
time, it was necessary to take into account variations in the density of the air in the 
ionization chamber. All ionization readings were therefore reduced to a standard 
temperature and pressure. An allowance was also made for the slight loss of charge 
from the chamber due to insulation leak and natural ionization. To estimate the 
leak, the chamber was charged to a point midway between the usual initial 
and final potentials, and allowed to stand for periods up to 24 hr. in the absence 
of radium. The loss of potential per hour was then observed, and the appropriate 
correction applied to the ionization readings. For chamber a the usual leak was 
about 0*3V/hr., so that even on the longest exposures the correction was little 
more than 1 %. Owing to the shorter exposures used with chamber 6, the natural 
leak corrections never exceeded 0*2 %, and were usually much less. 


Determination of absorption coefficient 

The ideal method for the determination of an absorption coefficient is to measure 
the slope at the appropriate point on a logarithmic transmission curve in the material 
considered. In practice, this method presents a number of difficulties, particularly 
with a heavy element such as lead. The quality of the radiation reaching the ioniza® 
tion chamber varies considerably with the filtration, and to obtain a true energy 



343 


Gamma-ray absorption 

transmission qurve, it is necessary to know precisely the constitution of the beam 
at all stages, and the ionization function of the chamber. Exact correction for the 
amount of scattered radiation reaching the chamber is also difficult if a variable 
absorber thickness is used. Furthermore, there is the purely geometrical difficulty 
of finding the slope of a curve which in some parts is varying only very slowly. In 
view of these objections, the method of Ahmad and others, of measuring the per¬ 
centage transmission through a fixed absorber, of any given radiation beam, has ■ 
been adopted, the coefficient then being determined by direct calculation. In the 
case of lead, it was necessary to use two absorbers. With lightly filtered radia¬ 
tion, passage through the 5 mm. lead absorber changed the quality of the beam 
appreciably, and the measured coefficient was found to decrease somewhat with 
increasing absorber thickness. For this reason, in the measurements on more 
heterogeneous beams, the 5 mm. lead absorber was replaced by one of 1*67 mm., 
giving a closer approximation to the true absorption coefficient. 

The choice of absorber thickness is also very closely bound up with the accuracy 
to which the coefficients can be determined. If x is the probable percentage error in 
one ionization measurement, then in the percentage transmission there is a probable 
error of ^(2) x . Assuming exponential absorption, to a first approximation it is clear 
that the error in the absorption coefficient is aJ( 2 )x/F, where F is the fraction of 
radiation absorbed. The lead and aluminium absorbers in the present experiments 
were chosen to transmit 70-75 % of the radiation. Thus, assuming the ionization 
measurements are correct to 0*2 %, the values of the absorption coefficients should 
have a probable error of not more than 1J %. In the case of graphite the trans¬ 
mission was 80-85 %, giving an error of some 2| %. It can also be seen that the use 
of the thin lead absorber, except for very heterogeneous beams, although desirable 
from some points of view could not be justified, owing to the loss in accuracy. It may 
be pointed out here that the accuracy of the ionization measurements does not 
depend to any extent on the radiation intensity. Approximately the same ionic 
charge is measured each time. This is not* the case with the more usual direct 
measurement of ionization current. 


Absorption coefficients in carbon, aluminium and lead 
Apparent absorption coefficients and corrections 

The apparent mass absorption coefficients (ftIp) for carbon and aluminium, as 
calculated directly from the measured ionization currents, with and without the 
absorber in position, are shown in the broken curves of figure 36 . In the case of lead, 
the apparent coefficients, thus calculated, are given in column 2 of table 2. To 
convert these values into energy absorption coefficients in the strict sense, two 
important corrections must be applied. These involve estimations of the scattered 
radiation reaching the chamber from filter and absorber, and of the ionization 
function of the chamber. 





344 


J. E. Roberts 


Scattered radiation 

The forward scattering of radiation into the ionization chamber by the absorber 
may lead to appreciable errors in the determination of the transmission through the 
absorber. Kohlrausch partially overcomes this by using large source-absorber- 
chamber distances, but this leads to very low radiation intensities. On the other 
hand, Ahmad, by taking readings at two absorber positions subtending 20° and 80° 
at the ionization chamber, extrapolates the results to a condition of zero scattering. 
Although there are unavoidable uncertainties in this calculation, the results agree 
fairly well with those of Kohlrausch. 

Tarrant (1932) has shown that, with the arrangement of figure 1, the correction 
for scattering is given by 

e/^true — c/^exp. = "h ~~ !)}>' 

where a = kv/mc* and 6 is the maximum angle of scatter from absorber to chamber. 
In order to make an approximate check of this formula, two series of absorption 
measurements were made using different absorber-chamber distances to give 
6 — 9£° and d = 14J°. When corrected in this way for scatter, the values for pjp 
for the two distances were found to agree within the experimental, limits. For the 
purposes of this correction, the approximate effective wave-lengths of the radiations 
were estimated from the aluminium absorption data of Kohlrausch, using the Klein- 
Mshina formula to connect absorption coefficient and wave-length. Most of the 
main experiments were carried out with the absorber subtending a half angle of 9|°, 
though a few results with 6 = 14£° are included. 

The Tarrant correction for scattering takes no account of the absorber thickness, 
which may reduce the amount of its own scattered radiation. For the 5 mm. lead 
and the aluminium absorbers, therefore, a further step in the correction is necessary. 
Tandberg (1937) has shown that the true transmission through a thick absorber is 
given by 

^ = e-Me-A, 

where the correction factor /? = (k d /jn) (1 —e~^ d ) and k 0 is the fraction of energy 
scattered between 0° and 6 °, the angle subtended by the absorber at the chamber. 
This formula reduces to that of Tarrant at small absorber thicknesses. According to 
Tarrant’s calculation from the Klein-Nishina formula, the value of k Q per electron 
is given by 

vr <2 4 

To assess the value of/?, it is necessary to assume approximate values for fi > and the 
experimental figures, as corrected by the Tarrant formula, were used. As is to JIf f 
expected, the correction from the Tandberg formula is somewhat smaller than from 1 
that of Tarrant. 



345 


Gamma-ray absorption 

Some scattered radiation from the filter will also reach the ionization chamber. 
It will, however, form less than 2 % of the beam incident on the absorber, and will 
have, on the average, a wave-length longer than the main beam by about 0*015 A. 
The effect of this radiation on the quality of the primary beam can clearly be 
neglected. 

Ionization function of chamber 

The estimation of the ionization function of the chamber is perhaps the most 
troublesome of all the aspects of precise gamma-ray measurements. This involves 
the determination of the relationship between measured ionization and the energy 
incident on the chamber. This relationship depends on the absorption in the cham¬ 
ber wall and in the enclosed air, which, in turn, depend on the wall material and 
the wave-length of the incident radiation. L. H. Gray (1936) has investigated this 
problem and shown that, in a small air cavity situated in a block of solid material, 
the rate of ionization I in the cavity is related to the energy flux E by 

r E (e^a + e r ) n 

W Pe . ’ 

where e cr a and e r are electronic scattering and photoelectric absorption coefficients 
for the wall material, 

n is the number of electrons per unit volume of air, 

W is the energy required to produce one i‘on pair, 

p e is the ratio of the electronic stopping powers of the wall material and the gas. 

This relationship is subject to the conditions that the dimensions of the air cavity 
are small compared to the photoelectron range in air, and that the wall thickness 
is at least equal to the maximum range of electrons produced in the wall material. 
It is clear that the large type of electroscope or large-volume ionization chamber 
commonly used for gamma-ray measurements cannot fulfil these conditions. In 
an analysis of Kohlxausch’s experimental results, Sizoo & Williams (1938) have 
assumed that all the ionization was due to air absorption. This is unlikely to be 
correct, as the design of the chamber is such that the lead walls must contribute 
appreciably to the ionization. Ahmad (1924a) attempted to absorb the greater part 
of the radiation in the ionization chamber by introducing iron and lead plates. 
As has been pointed out by Stoner (1929), this does not entirely answer the problem, 
and the calculation of the ionization function is greatly complicated. Stoner (1929) 
has carried out this calculation with reasonably satisfactory results. On the other 
hand, J. A. Gray (1927), using a wood and graphite electroscope, measured the 
absorption of a graphite plate in contact with the chamber wall, and then used the 
coefficient obtained as the ionization function of the electroscope. This approximates 
more nearly to the method suggested by the theory of L. H. Gray. 

The use of a small condenser type of ionization chamber makes it possible to 
fulfil quite closely the requirements of the above equation. The 5 mm. wall thick¬ 
ness is greater than the maximum range of the secondary electrons produced in 



346 J. E. Roberts 

graphite, and the linear dimensions of the air volume are of the order of 1 % of the 
average electron range in air. 

L. H. Gray has shown that W and p e are sensibly constant over a wide range of 
wave-lengths. Furthermore, for a graphite chamber and the gamma-rays of radium 
(B + C), photoelectric absorption in the wall is negligible. One can therefore write 

I = const, x e <r a . 

Using the experimental method for the determination of flip described earlier, this 
ratio IjE only varies when the mean wave-length of the radiation is changed during 
absorption. In the cases of carbon and aluminium absorbers this change is negligible, 
and no correction of the values of fljp for ionization function is necessary. For the 
lead absorbers we obtain the percentage energy transmission 


E_ 


l°£a 

-^0 e^a 


where %<r a corresponds to the radiation before absorption, and e <r a corresponds to 
the radiation after absorption. The effective wave-lengths for the determination of 
e cr a were obtained by the method described later. Although e cr a varies only relatively 
slowly with wave-length, the effect of this variation is greatly magnified in the 
calculation of absorption coefficients. For the most lightly filtered radiations, the 
change from ionization to energy ratios may lead to a change of as much as 10 % in 
the value of fljp for lead. The magnitudes of the scattering and ionization function 
corrections at a number of filtrations are indicated in table 2. The correction for 
scattering in the cases of carbon and aluminium absorbers can be seen from figure 3 6. 

Table 2. Corrections to absorption; coefficients in lead 


filter 

p/p 

p/p 

corrected for 

p/p 

corrected for 
scatter and 

cm. lead 

(apparent) 

scatter only 

ionization function 

0-2 

0*0985 

0*1002 

0*0909 

0*7 

0*0680 

0*0697 

0*0658 

1*2 

0*0601 

0*0650 

0*0584 

1*7 

0*0559 

0*0572 

0*0550 

2-2 

0*0527 

0*0541 

0*0519 

3*2 

0*0479 

0*0511 

0*0498 

4*2 

0*0470 

0*0485 

0*0474 


Corrected absorption coefficients 

The finally corrected values for the total mass absorption coefficients {flip) for 
aluminium, carbon and lead are plotted as a function of lead filter thickness hi 
figure 3 a, 6. In view of the rather d iff erent approach to the problem, the results fbit i 
aluminium agree well with those calculated by Rutherford from the data of Kohl- 
rausch. The new values of jafp for lead are appreciably lower than those given by 



Gamma-ray absorption 347 

Rutherford over the whole range. This is very probably due to a more adequate 
compensation in the present results for chamber ionization function. Lack of this 
compensation will, in general, tend to give high values for yip. 

Although the values of yip for carbon are subject to greater experimental error 
than the rest, a comparison of the results for aluminium and carbon forms a useful 



filter thickness (cm. lead) 

Figure 3a. Absorption of gamma-rays in lead (corrected values). 



05 1*0 15 2*0 2-5 3-0 3-5 4*0 

filter thickness (cm. lead) 

Figure 36. Absorption of gamma-rays in carbon and aluminium. 
Broken curves, apparent values ; full curves, corrected values. 





348 


J. E. Roberts 


check on the method. In table 3 are shown the values of the absorption per electron 
( e/ «) f° r a number of radiation qualities. The absorption in carbon and aluminium in 
this wave-length range is almost entirely by scatter, and hence the absorption per 
electron should be the same for both elements. Table 3 shows that this is the case to 
well within the experimental error. The small difference between the nip curves for 
carbon and aluminium is thus due to the difference in the electronic densities of the 
two materials. 


Table 3. Electronic absorption coefficients in carbon and aluminium 


filter, cm. lead 

0-2 

0*5 

1*0 

2*0 

3*0 

4*0 

e /i x 10 24 carbon 

0*216 

0*195 

0*183 

0*171 

0*164 

0*159 

e /ix 10 24 aluminium 

0*217 

0*197 

0*184 

0*172 

0*165 

0*161 


Absorption coefficient and wave-length 
Structure of the absorption coefficient 

As there is no photonuclear absorption in the materials used, the total absorption 
coefficient may be written as 

p p p p’ 

where crjp is the coefficient corresponding to scattering, 

7 ip corresponds to photoelectric absorption, 

Xlp corresponds to electron pair formation. 

' * 

Each of these factors has been investigated theoretically by various authors. 
Scattering of high-frequency radiation is well accounted for by the formula of 
Klein & Mshina (1938), from which vakfcs of cr can be calculated.* The validity of 
the formula over a wide range of wave-lengths has been established by Read & 
Lauritsen (1936) and others using monochromatic X-rays and gamma-rays. The 
most recent theoretical treatment of the problem of photoelectric absorption is that 
of Hulme, McDougall, Buckingham & Fowler (1935). They have considered the 
perturbation of a simple atomic system by a gamma-ray beam, and, with some 
approximations, have obtained curves connecting the photoelectric coefficient r 
with the wave-length of the incident beam. Their results are given as graphs of 
r ato ml 2*0 against 6 , where 6 = mc % \hv. For the present purposes the Hulme curve 
for lead has been used to calculate the values of r/p up to a wave-length of 35 x.u., 
and extrapolated with the aid of L. H. Gray’s (1931) empirical formula up to 50 x.u. 
Absorption by electron pair formation is only significant for the shorter components 
of the gamma-rays of radium (B + C), and forms a small percentage of their total 
absorption. Approximate values of x in this region have been obtained from thq 

* The writer is indebted to Mr S. B. Osborn, who has recalculated the values of .cr to a high 
degree of accuracy, using the most recent values of the atomic constants 

\ 



349 


Gamma-ray absorption 

theoretical work of Bethe & Heitler (1934). The results of calculations for lead from 
these three sources are shown for wave-lengths up to 16 x.rr. in figure 5 a. Further 
figures over a wider wave-length range are given in column 2 of table 5 . 


Effective wave-lengths 

Before the experimental results given in figure 3 can be compared with theoretical 
predictions, it is necessary to make some estimate of the effective wave-lengths of 
the gamma-ray beams used. Except under extremely heavy filtrations the gamma- 
rays of radium (B 4* 0 ) are very heterogeneous, and it does not appear possible to 
determine a single effective wave-length suitable for all purposes. The most obvious 
method, as used by a number of workers, is to make use of the aluminium absorption 
coefficients of the same beams as are used for the measurements on lead. Absorption 
in aluminium in this region being purely by scatter, the scattering per electron ( e cr) 
can be derived for each radiation quality from the experimental figures. The corre¬ 
sponding wave-lengths are then found from the Klein-Nishina formula. These wave¬ 
lengths have, however, a limited application. Strictly they should only be used 
when absorption by scatter alone of the heterogeneous beams is being investigated. 
Photoelectric absorption increases much more rapidly with wave-length than 
scatter, and hence with a heterogeneous beam the longer wave-lengths predominate 
in determining the coefficients. This leads to a higher effective wave-length. In the 
case of radium gamma-rays in lead, the problem is more complicated, as both 
scattering and photoelectric absorption play a part. 

In view of these difficulties, an attempt has been made to adapt the method 
indicated by Stoner (1929), and to calculate the effective wave-lengths of filtered 
gamma-rays from the known data on the spectrum of radium (B 4 - C). This spectral 
distribution has been investigated by different methods by Skobelzyn (1927, 1929) 
and Ellis & Aston (1930). 

Using an expansion chamber, Skobelzyn measured the numbers and velocities 
of the recoil electron tracks produced within 20 ° of a narrow pencil of gamma-rays. 
From each such track the frequency of the original quantum producing it could be 
deduced. Thus as, according to the Klein-Nishina formula, the number of electrons 
ejected within 20 ° of the gamma-ray beam is practically independent of the fre¬ 
quency, the number of tracks corresponding to any one wave-length gives a measure 
of the relative number of quanta of that wave-length in the beam. When Skobelzyn’s 
results are plotted as a distribution curve against wave-length, a series of well- 
defined peaks are observed. Most of these can be quite definitely associated with the 
known /?-ray spectrum of radium (B 4* C). It is then a comparatively simple matter 
to separate the peaks and so estimate the relative number of quanta corresponding 
to each wave-length in the gamma-ray spectrum. These are shown in column 3 of 
table 4 . To convert these figures into the energy distribution it is. only necessary to 
multiply each value oiAN by the appropriate hv. In column 5 of table 4 , the energy 
AE of each spectrum line is given as a percentage of the total emission. 



350 


J. E, Roberts 

Ellis & Aston (1930) determined, by a photographic method, the relative photo¬ 
electron emission from platinum for each observable line in the gamma-ray spec¬ 
trum. Their results are given in terms of Nr k for each wave-length, where N is the 
relative number of quanta in that wave-length and r k is the corresponding k level 
photo-electric absorption coefficient in platinum. To obtain N it is therefore necessary 
to make some assumption as to the variation of r with wave-length, but as a first 
approximation the formula of Hulme, McDougall, Buckingham & Fowler (1935) ma y 
be used. For the present purpose this is a disadvantage, making the final comparison 
of experimental and theoretical results rather uncertain. In this attempt to deter¬ 
mine effective wave-lengths, both spectral distributions have been investigated, but 
to avoid confusion, only the Skobelzyn figures are shown in the accompanying tables. 


Table 4. Energy distribution in gamma-ray spectrum 
(filtration, 0-4 cm. lead) 


number 


A 

X.TJ. 

hvx 10 3 

V 

of tracks 
AN 

hv x AN = AE 

AE% 

4*15 

2980 

2 

6*0 

1*2 

4*64 

2660 

6 

16*0 

3*1 

5*10 

2420 

6 

14*5 

2*7 

5*57 

2219 

14 

31*1 

6*1 

5*91 

2085 

13 

27*1 

5*3 

6*94 

1778 

69 

122*7 

24*0 

8*89 

1390 * 

65 

90*4 

17*7 

9*89 

1248 

24 

30*0 

5*9 

10*92 

1130 

29 

32*8 

6*4 

13*1 

941 

35 

32*9 

6*5 

16*0 

773 

26 

20*1 

3*9 

20*2 

612 

105 

64*3 

12*6 

34*9 

354 

27 

9*6 

1*9 

41*6 

297 

29 

8*6 

1*7 

50*8 

243 

21 

5-1 

. 1*0 


In order to obtain the energy distribution for each filter used in the experiments, 
one has to reduce AE for each wave-length by the absorption factor (er^) appro¬ 
priate to the filter. The coefficients {/i) calculated from theory in the previous section 
have been used. The results of the calculations, giving energy distributions up to 
5 cm. lead filtration are shown in table 5 . Incidentally, it may be pointed out that 
for each filtration, the sum 2 A i? gives the percentage transmission of the composite 
radiation, and a predicted transmission curve can thus be built up. The filtration 
used by Skobelzyn ( 3*5 mm. lead-b 2 mm. glass) is taken as equivalent to 4 mm. lead. 

Using the energy distributions shown in table 5 , it is now possible to estimate the 
effective wave-length of the radiation for each filtration. The simplest method, as 
suggested by Mayneord (1933), would be to weight each component according to 
the energy in that wave-length. Thus } 

_ EX AE 
effi * “ EAE * 



351 


Gamma-ray absorption 

This takes no account of the type of absorption to which the radiation is to be 
subjected. If the absorber is a light element, and photoelectric absorption is 
negligible, the effect of each energy, component must be weighted accordingly. 
From the Klein-Nishina formula it can be shown that, over the wave-length range 
5-50 X.TT., the scattering absorption coefficient cr is very nearly proportional to A*. 
Thus, for such an absorber, it is probably more correct to write 



Effective wave-lengths, calculated in this way, are shown in curve A of figure 4 . It 
was found that the wave-lengths in this curve agreed reasonably well with those 
obtained from the experimental absorption data for aluminium and carbon, in 
which all the absorption is by scattering. The wave-lengths thus calculated were used 
in determining the chamber ionization function. 


Table 5. Spectral distribution op filtered gamma-rays 


filter cm. lead 

0*0 

0*2 

0*4 

1*0 

1*4 

2*0 

3*0 

4*0 

5*0 

A (x.tr.) 

fi (lead) 

AE Q 

AE Z 

AEt 

AE 10 

AE U 

. AE i0 

AEq o 

AEw 

AEw 

4*15 

0*456 

1*4 

1*3 

1*2 

0*9 

0*76 

0*58 

0*37 

0*23 

0*15 

4*64 

0*465 

3*7 

3*4 

3*1 

2*3 

1*95 

1*47 

0*93 

0*58 

0*29 

5*10 

0*483 

3*9 

3*0 

2*7 

2*0 

1*67 

1*25 

0*77 

0*47 

0*29 

5*57 

0*490 

7*4 

6*7 

6*1 

4*6 

3*74 

2*79 

1*71 

1*05 

0*64 

5*91 

0*502 

6*5 

5*9 

5*3 

3*9 

3*21 

2*37 

1*44 

0*87 

0*53 

6*94 

0*534 

30*3 

27*3 

24*0 

17*8 

14*10 

10*40 

6*11 

3*58 

2*10 

8*89 

0*619 

22*7 

20*1 

17*7 

12*2 

9*50 

6*59 

3*55 

1*92 

1*03 

9*89 

0*670 

7*7 

6*7 

5*9 

4*0 

3*02 

2*02 

1*04 

0*53 

0*27 

10*92 

0*724 

8*6 

7*4 

. 6*4 

4*2 

3*10 

2*01 

0*98 

0*47 

0*23 

13*1 

0*832 

9*1 

■ 7*6 

6*5 

3*9 

2*83 

1*72 

0*75 

0*32 

0*14 

16*0 

1*060 

6*0 

4*8 # 

3*9 

2*1 

1*35 

0*72 

0*25 

0*09 

0*03 

20*2 

1*384 

21*9 

16*7 

12*6 

5-5 

3*16 

1*38 

0*35 

0*09 

0*02 

34*9 

3*31 

7*1 

V 4*1 

1*9 

0*26 

0*07 

— 

— 

— 

— 

41*6 

4*52 

10*4 

4*2 

1*7 

0*11 

0*02 

— 

— 

— 

— 

50*8 

6*60 

13*9 

3*7 

1*0 

0*02 

— 

— 

— 

— 

— 


ZAE 

160 

123 

100 % 

63*8 

48*5 

33*3 

18*3 

10*2 

5-7 


For photoelectric absorption, the Hulme formula indicates that the coefficient r 
is roughly proportional to A 2 ' 1 over these wave-lengths. Thus in a material in which 
the absorption was purely photoelectric 



These effective wave-lengths are plotted in curve B of figure 4 . They are consider¬ 
ably higher than those for scattering, particularly at the lower filtrations. It is 
clear that the effective wave-lengths for absorption in lead lie between curves A 
and B. A weighted mean of the two is required. The method adopted is, for each 
filtration, to weight the effective wave-length for photoelectric absorption by the 



352 J. E. Roberts 

proportion of the total energy which is absorbed in this way, and siihilarly with the 
wave-length for scattering. Thus 

}_/v_ p 

A “ ZAE 

where p A eff is the effective A for photoelectric absorption, and sA^. is the effective A 
for absorption by scattering. Nuclear absorption has been neglected. The wave¬ 
lengths obtained in this way are shown in curve G of figure 4 , and these have been 
adopted as the most probable values for the radiations used in the experiments. 
They do not, in fact, differ greatly from the simple mean values mentioned at the 
beginning of t his section. Identical calculations, using the gamma-ray spectrum of 



Figtjbe 4. Effective wave-length of filtered gamma-rays from radium (B + C) (Skobelzyn 

( 22/$ A E\ 2 

— 1 • Curve B, photoelectric absorp¬ 


tion, P A effi 


_2*i f(ZX^AE\ 


Curve C, weighted mean for absorption in lead, 
AE+XzZ- AE 


ZAE 




Gamma-ray absorption 353 

Ellis & Aston, lead to effective wave-lengths which are greater by about 1 x.u. for 
the same filtration over most of the range. 

In connexion with the above calculation, it is of interest to observe the variation 
of the relative proportions of photoelectric and scattering absorption in lead as the 
primary radiation is more heavily filtered. This is shown in table 6. 


Table 6. Photo- and scattering absorption in lead 


filter cm. lead 

0-0 

0*2 

0*4 

1-0 

1*4 

2-0 

3*0 

4-0 

5-0 

% photo-absorption 

25-4 

29*2 

25*8 

21-8 

20-7 

19*5 

18-3 

17-5 

16-9 

% scattering absorption 

64-6 

70*8 * 

74*2 

78*2 

79*3 

80*5 

81*7 

82-5 

83-1 


Comparison of theory and experiment 

The filtration scale of figure 3 a can now be converted into a scale of wave-lengths, 
estimated from the Skobelzyn spectrum, with the results shown in figure 5 a, curve B. 
The experimental values for the total mass absorption coefficients in lead are plotted 
for comparison with those calculated on theoretical grounds in a previous section. 


0*08 


007 

A 

0-06 


005 


004 


003 


002 


00 ! 


70 80 90 10O 110 120 130 140 

effective wave-length (x.u.) 

Figure 5 . a, Mass absorption coefficients of gamma-rays in lead. 6, Mass photoelectric 
absorption coefficients of gamma-rays, in lead. Curves A, calculated from theory. Curves B, 
experimental (wave-lengths from Skobelzyn spectrum). Curves C, experimental (wave-lengths 
from Ellis & Aston spectrum). 





354 


J. E. Roberts 


Although there is fair agreement in the absolute values of pjp at short wave-lengths, 
the experimental coefficients tend to rise more rapidly with increasing wave-length 
than the theoretical values. The Klein-Nishina values for the scattering coefficient 
ajp have been well established experimentally, and nuclear absorption is negligible 
above 10 x.u. The photoelectric coefficients have therefore been separated from the 
rest, and plotted in figure 56 . Here the theoretical values are from Hulme et ah 
(i935), and the experimental figures have been obtained by subtracting the cal¬ 
culated ( <r[p + xlP ) from the measured total coefficient fi/p. As would be expected, 
the same tendency is apparent, the experimental values of r/p rising more rapidly 
with increasing wave-length than the predictions of theory. It is of interest to 
compare the variations of photoelectric absorption for gamma-rays with that 
observed in the X-ray region. An analysis of the experimental results shows that 
they are represented approximately by the formula 

- = const, x A 2 ’ 8 . 

P 

The exponent of A in this formula is in better agreement with the value 2* 92 normally 
found for X-rays than with the more generally accepted 2*2 for gamma-rays, but 
the proportionality constant is considerably larger. The value 2*8 for the exponent, 
however, should be accepted with some reserve, as the range of wave-lengths 
observed in'the present experiments is very limited. 

There is some evidence that the Skobelzyn results on the gamma-ray spectrum 
tend to overestimate the energy content of the shorter wave-length components. 
This would lead to effective wave-lengths for the heterogeneous beam which are 
too small. It is unlikely that this can fully account for the difference between 
theory and experiment shown in figure 5 . Although, as pointed out earlier, the 
method is not entirely satisfactory, the effective wave-lengths weye also calculated 
using the spectral distribution obtained by the method of Ellis & Aston (1930), 
which is insensitive to the shorter wave-lengths. Using these figures (figure 5 , 
curves 0), the absolute discrepancy between theory and experiment is somewhat 
less, though still large. The variation of r/p with wave-length is almost identical 
whichever spectrum is used, the exponent of A in the equation above again being 
about 2*8. 

I wish to thank Miss D. R. Tavener for her assistance in the somewhat tedious 
computations involved. Thanks are also due to the Medical Research Council for 
the loan of part of the radium used in the experiments. 

References 

Ahmad 1934 a Proc. Boy. Soc . A, 105, 507. 

Ahmad (& Stoner) 19246 Proc. Boy. Soc . A, 106, 8 . 

Ahmad 1925 Proc. Boy. Soc. A, 109, 207. 

Bethe & Heitler 1934 Proc. Boy. Soc . A, 146, 83. 

Ellis & Aston 1930 Proc. Boy. Soc. A, 129, 180. 



355 


Gamma-ray absorption 

Glasser 1928 Strcihlentherapie, 29, 549. 

Gray, J. A. 1927 Trans. Roy. Soc. Can. 21 , 163. 

Gray, L. H. 1931 Proc. Carrib. Phil. Soc. 27, 103. 

Gray, L, H. 1936 Proc. Roy. Soc. A, 156, 578. 

Hulme, McDougall, Buckingham & Fowler 1935 Proc. Roy. Soc. A, 149, 131, 

Klein *& Nishina 1928 Z. Phys. 52, 853. 

Kohlrausch 1917 S.B. Akad. Wiss. Wien, 126, 441. 

Mayneord 1933 Brit. J. Radiol. 6 , 598. 

Mayneord & Roberts 1937 Brit. J. Radiol. 10 , 365. 

Read & Lauritsen 1936 Phys. Rev. 45, 433. 

Rutherford, Chadwick & Ellis 1930 Radiations from Radioactive Substances, pp. 466-80. 
Camb. XJniv. Press. 

Sievert 1932 Acta Radiol., Stockh., Suppl. 14. 

Sizoo & Williams 1938 Physica, Eindhoven, 5, 100. 

Skobelzyn 1927 Z. Phys. 43, 354. 

Skobelzyn 1929 Z. Phys. 58, 595. 

Stoner 1929 Phil. Mag. 7, 841. 

Tandberg 1937 Uppsala Dissertation. 

Tarrant 1932 Proc. Camb. Phil. Soc. 28, 475. 




The normal vibrations of bridged X % F e molecules 
By R. P. Bell and H. C. Longxtet-Higgxns 
{Communicated by E . J. Bowen , F.R.S.—Received 7 February 1944 ) 

r \ ' / Y \ 

The molecular model considered is , with a tetrahedral distribution of 

valencies round the atom X . This structure applies to the dimeric trihalides of aluminium, 
indium and gallium, and to the hydrides B 2 H 6 and Ga 2 H 6 . Symmetry classes and selection 
rules are given, followed by equations for the normal vibration frequencies derived from a 
valency-force treatment, taking into account the interactions between unlinked atoms in 
the ring* A special treatment is necessary for the bending vibration in which the potential 
energy is proportional to the fourth power of the displacement. 

The observed infra-red and Raman spectra of diborano can be satisfactorily assigned on the 
basis of this model, fifteen out of the seventeen active fundamentals being observed. These 
fifteen frequencies are predicted with an average error of 2 % and a maximum error of 6 % 
by the insertion of six adjustable force constants in the theoretical equations. The observed 
and calculated vibration frequencies are used to calculate the specific heat of diborane in the 
range 100-300° K. The calculated values agree with experiment, except for a small dis¬ 
crepancy at the highest temperature which can be reasonably accounted for. 

The above results provide strong evidence that diborane has a bridge coijfiguration rather 
than one resembling ethane. The force-constants obtained show that the terminal B-H links 
are normal single bonds, while the bonds forming the bridge are considerably weaker. This 
is in harmony with the greater length of the bridge links, and the small number of electrons 
available for their formation. It is concluded that the hypothetical reaction 2BH 3 -*-B 2 H e 
involves the absorption of at least 15 kcal./mol. 

A simplified form of the frequency equations is used to calculate the Raman frequencies 
of the dimeric halides of aluminium. The results are in semi-quantitative agreement with 
the observed frequencies, and lead to an assignment which is consistent with the observed 
intensities. 


Introduction 

The only vibrational analysis so far carried out for molecules of formula X 2 Y$ 
relates to the ethane structure Y z X . XY Zi with or without free rotation of the XY Z 
groups. This model will apply to ethane itself, to Si 2 H 6 , and to the hexa-halogen 
derivatives of these compounds. There is, however, an alternative structure for a 
molecule X 2 F 6 , in which two of the Y atoms form a bridge between the X atoms, i.e. 

y y y 

. There is direct evidence that this structure is the correct one for 

the dimeric trihahdes of aluminium, indium and gallium (Palmer & Elliott 1938; 
Brode 1940), and it probably also applies to other dimeric halides suoh as Ee 2 Cl # 
and Au 2 C 1 6 . Moreover, it has been recently shown (Longuet-Higgins & Bell 1943) 
that diborane, B 2 H 6 , and the recently discovered Ga 2 H 6 (Wiberg & Johannsen 1942) 
probably also have a bridged structure. Experimental data are available for the v 
Raman spectra of the aluminium Halides, and for both Raman and'infra-red spectra 
of diborane. It therefore seems worth while to carry out a quantitative treatment of 
the vibrations of the bridged molecule. 


24 


Vol, 183. A. (18 June 1944) 


[ 357 3 



358 


R. P. Bell and H. C. Longuet-Higgins 


Symmetry properties 

In most of the molecules mentioned above, the valencies of the X atoms are 
probably tetrahedrally disposed, so that the molecule X 2 Y Q is non-planar. In a few 
cases (e.g. Au 2 C 1 6 ) the valencies, and hence the whole molecule, are probably planar. 
In either case the molecule belongs to the symmetry group D 2h , like ethylene. Since 
the ethane structure has symmetry D zh or D zd the vibrational spectrum should 
provide a useful criterion for deciding between the ethane and the bridge structures. 

In the present paper we shall deal only with the non-planar model. The sym¬ 
metry characteristics of the molecule are then conveniently taken as three planes 
at right angles: cr x through the two central Y atoms and perpendicular to the line 
joining the X atoms, cr y containing the two X atoms and the four terminal Y atoms, 
and c t z containing the two X atoms and the two central Y atoms. The symmetry 
classes, selection rules and number of vibrations in each class are given in table 1, 
the symbols used being those of Placzek (1934). It will be seen that there are no 
degenerate frequencies, and that no vibrations are active both in the Raman and 
in the infra-red spectrum. In both these points there is a contrast with the ethane 
model of symmetry D zh or D zd . 

Figure 1 illustrates approximately the character of the vibrations listed in table 1, 
assuming that the bending force constants are considerably smaller than the 
stretching ones. The ring is perpendicular to the plane of the paper, and the signs 
+ and — represent motion out of and into this plane respectively. No attempt has 
been made to show all the motions of any given vibration. 

Table 1 


class 

symmetry with respect to 

activity 

vibrations 

. 

s 

s 

s 

Raman (p) 

»1 v* 


as 

as 

as 

inactive 

v 5 


as 

as . 

s 

Raman (dp) 

V 6 Vij 

B lu 

s 

s 

as 

infra-red (M z ) 

*>9 V 10 

B^a 

as 

s 

as 

Raman (dp) 

^11 v n 

Bzu 

s 

as 

s 

infra-red (M y ) 


B zg 

s 

as 

as 

Raman (dp) 

Vis 

B^u 

as 

s 

s 

infra-red (M x ) 

Vis ^17 ^18 


Valency-force treatment 

The nomenclature used for the dimensions of the molecule at rest is shown in 
figure 2. M and m represent the masses of X and Y respectively, and it is convenient 
to distinguish the central Y atoms by the symbol Y'. The expression assumed for 
the potential energy is 

v = 4. iUAX F) 2 +± V) 2 

+ 2. \d x a\A Y'XYy+ 2 . \d^(A YXYf + 2 . |d s a 2 (dZFZ) 2 
+ 8. \d,ab{A YXY'f + \g x {AXXf + \g z {A F F) 2 , 


( 1 ) 



359 


The normal vibrations of bridged X 2 Y 6 molecules 

which will apply to any of the normal vibrations listed in table 1, Of the force con¬ 
stants in equation (1), f x and f 2 refer to bond-stretching, there being two different 
constants because the linkage between X and Y in the ring differs in nature from the 
terminal linkages. Similarly, there are three different bending force constants 
(d v d 2 and e? 4 ) for valencies attached to X } and one (d z ) for valencies attached to Y'. 



Figtjbe 1 


24-2 



360 R. P- Bell and H. C. Longuet-Higgins 

These six constants are all that is required in a simple valency-force treatment for 
the bonds shown in figure 2, hut we have added the terms in g x and g z , corresponding 
to interactions between the pairs of opposite atoms in the ring. Although the 
ordinary valency picture represents these atoms as unlinked, they are likely tq he 
much closer together than is common for unlinked atoms, and their interactions 
may he considerable. These terms only enter into vibrations of the class A lg , since 
in all other classes the distances XX and T T remain unchanged. 

Y(m) 


Figure 2 

In deriving the equations for the normal vibration frequencies we have used two 
sets of co-ordinates: (a) the Cartesian co-ordinates of the atoms, and (6) the bond 
lengths and angles. In the absence of degeneracy either set are satisfactory symmetry 
co-ordinates, and the secular equation factorizes so that each symmetry class can 
be treated separately. The equation for each class has been derived independently 
in both sets of co-ordinates, agreement between the two methods constituting a 
valuable cheek on the rather involved algebra. 

It is convenient to introduce the following abbreviations: 
e x = cos 6 X , c 2 = cos 0 2 , s x = sin 6 V s 2 = sin d 2 , 

i _ i 1 1 _ JL 2 1 _ 1 ^ 3 1 _ c\ sf 

m 1 ~ m M’ m 2 m M’ m s in"*" M’ m 4 — m M’ 

1 _ 5 i c i 1 _ 1 2 c§ 1 _ 1 2 s| 1 _ 4 c x c 2 

m s ~ vn M' m 6 ~ m + M\ m 1 ~ m + M’ m & ~ M ’ 

fi =47rV, d 5 = 2d 4 /(l—c|c|), D = s\f x +2c\d z , E = c{f x +2s\< 

X-i = /i++ s|g 2 , X 3 = 2 d x + 2 d z -\ —®ic|d5 + s|gf 4 +c5 

CL 

= 2 d 2 +-^c\sld 5 , K 5 = s 1 c 1 s 2 C 2 d 5 , K$ = (^i— 





The normal vibrations of bridged X a Y e molecules 
The equations for the frequencies of the normal vibrations are then as follows: 


m | 

+ 2 K.K. 


. T2K, K 2 2K 3 K, 1 1 1 \~| 

1- /t 4 —/t 3 —-4—- + — 3 + — — 4Z 5 -|^ — dZgSjCjl--jrp 

la r L to 5 m 6 m i m i - 3 * \m J2/J 

(|4) + ^ t +2 ^^-^(|4) 

-iK I (2i 1 +i: 1 )^- ( ur 1 Xrtc l (i--^)-4X t jc,» 1 o 1 (i 
+ar s jr, - i [iS (X A - j®+ t|s (£, £. - J£» 

+2r,(Z»X t -Z:)(^+ : i)+2iAX 1 (A+g)-8X 1 Z a i s ^ 
*rr W 17 SlCl i °TT V TT 

— 4 it 2 A 4 A 6 -^ + 8A a ii 6 A 6 -^r ^ | 


2c| 

M 


) 


4 Z. 


^x« V = 


+^ [if, Z 3 if 4 - K, IP, -K.KI] = 0, 

i.® “ ’ 

mb 




4H>ia + ?£ + i) + y- # -f 

2J lu /* = 0 or ii 2 —iX— J rs\di < 2.~ — +t—+ —]1 
lw r r L 5 ' w 7 l am i TO sJJ 

4^ 2 ^ + ®K54,]- 0 ’ 

' ,1 -'“[^ +,i ‘ s5 {5B + 5#„ + i)] + S A ‘ i «' i [5S + sS + SL!" *• 


a m x bm 2 m 8 ) J mm 3 b 


d. 




J su 




m a 


3 n£ 2E + 4-^2 4-/7 fo ^ 5 1 C 2 , a £lf|_ flffll ./f P 2 jE^/ f 

^- /t L^ + ^ + ^' +d5 («^ + ^'^rlJ + ™L tv» 

+2d2 (i + i)} + 2d { 2 1 s ^ di+ h* s * [m 7 + m) 

. 2 si+, d Li«JJL + ±\ + &*- 4 $\ 

m 2 ** 1 5 { a 1 2 \m s M] b m % m 8 jj 


+ - 

-^J_ 2 d ^ 44 +d 5 JS~clsl + 2 d 2 jf^ - 0. 



362 R. P. Bell and H. C. Longuet-Higgins 


Kohlrausch & Wagner (1942) have recently given expressions for the Raman 
frequencies of a simplified type of bridged molecule, and a later paper (Wagner 
1943 ) gives corrections to these expressions and extends the treatment to the infra¬ 
red frequencies. They consider a molecule in which all the bond lengths are equal 
and the valencies of the atom X are directed towards the vertices of a regular 
tetrahedron. Their expressions should therefore be obtainable from ours (allowing 
for differences of notation) by putting a = 6, = W 2 = 109 °. In most cases there 

is agreement, but we believe their expressions for B 2g and B Zu to be in error. The 
correct equation for B 2g is (in their notation) 


B29 ^ 




In the case of B Zu their equation involves a fundamental error, since it does not 
contain the bending force constant for the angle YXY (d 2 in both notations). Kohl- 
rausch & Wagner have not given explicitly the equation for class A lg , but we agree 
with their (corrected) expression for the secular determinant. 


The fourth-power vibration 1 

In the above treatment the frequency v l0 of class B lu comes out as zero, since for 
a first-order infinitesimal displacement the angles and the distances undergo only 
second-order changes. It has been shown elsewhere (Bell 1944) that in this type of 
ring-bending the potential energy of the system is proportional to the fourth power 
of the displacement, and the theory of such 'fourth-power vibrations’ has been con¬ 
sidered. For a single particle of mass m having a potential energy given by V(x) = Ax 4 " 
it was shown that the lowest spectrum frequency v 0 is given by 

v 0 = 0 - 163 (M/m 2 )*. ( 4 ) 

This equation can be used to calculate the frequency v 10 in the molecule X 2 Y e , 
provided that somp simplifying assumptions are made. In the first place, it is 
necessary to treat the fourth-power vibration as a separate problem, divorced from 
the other vibrations of the molecule. Secondly, the terminal XY 2 groups are con¬ 
sidered to move as a whole without distortion. There is no strict justification for 
either of these simplifications, but they are both physically reasonable provided that * 
the fourth-power frequency is considerably lower than the other frequencies of the 
system: this is likely to be so in practice. If the XY 2 groups undergo an arbitrary 
small displacement z perpendicular to the plane of the ring, then the vibration of the 
ring can be described by the following scheme of displacements: 



Sx 

$y 

Sz 

XY Z 

-f — z* 
a 

0 

« 

+ z 

Y' 

0 

, A z <f 2 
+ _£2_ z 2 

a 

-qz 


(5) 




363 


The normal vibrations of bridged X 2 Y 6 molecules 

where q = (M + 2m) jm, and A x and A, are parameters to be determined subsequently. 
The changes in distances and angles for small z can then be obtained from the 
ordinary geometrical expressions by expanding in powers of z 2 and rejecting higher 
powers. The results are: 


AXT = £{(1+#-2c 1 A 1 -2^ 1 A 2 } ) 


AY'XY' = --£■ {(l+ff)»«i-2 Cl « l A 1 + 2g»cJAJ > 

tv 

AXY'X = -~-Ul+q)*c 1 + 2slX 1 -2q\ Sl \ i }, 
AYXY' AXX = -^z 2 , 

i 


> 


AY ’Y’ = 

a 


( 6 ) 


In terms of the force constants already defined, the potential energy is then 
given by 


V = 4. if x (AXY') 2 + 2 . \d x a\A Y'XY'f + 2. \d z a\AXY'Xf 
+ 8. \d ia b(A YXY'Y + to^AXXY+W Y'Y'f 


= ^*[/i{( 1 +#- 2 c 1 A 1 -2g^A 2 P , 

+ (fcf + l ~cf 2 c x A x + 2g ,2 cf A 2 } 2 

+ {( 1 + <?) 2 Cj + 2s? A x - 2 q\ A 2 } 2 + 4gr x A| + 4gr 2 f/AlJ 

= -4a*, . (7) 


3F 


0 K 

whence A x and A a are determined by minimizing V, i.e. putting ^ 


The kinetic energy is to the same approximation 

T = 2. \m{qz) 2 + 2. \qmz 2 = £.2mg'(l + q)z 2 . 
Comparing this with equation (4) for a single particle, then 


dV 

3A 2 


= 0. 


( 8 ) 


v 


10 


0-163 


hA Y 
4m 2 g a (l + q) 2 J ’ 


(9) 


where A is defined by (7) in terms of known quantities. It should be noted that v 10 
no longer belongs strictly to the symmetry class and its activity is thus not 
restricted to the infra-red spectrum. However, its Raman activity is likely" to be 
weak, since the relevant changes in distances and angles are of the second order, 
compared with the first-order change in electric moment. 



364 


! R. P. Bell and H. C. Longuet-Higgins 


Application to the spectrum op diborane 

Experimental data . The Raman spectrum of liquid diborane has been studied by 
Anderson & Burg (1938), and the infra-red spectrum of the gas by Stitt (1941). The 
results obtained are given in the first three columns of table 2. (Two of the Raman 


Table 2. Infra-red and Raman spectra op diborane 


infra-red spectrum 


(obs.) 

cm. -1 

uncertainty intensity 

assignment 

393] 





412 

■ II (?) 

3 

strong 

fundamental v 10 

431J 





679 


3 

weak 

1863 — 1180 = 683 

831 


3 

weak 

1608-793 = 815 

981 


2 

strong 

fundamental v 14 

1154] 





1178 

•ii 

2 

very strong 

fundamental v is % 

1197 J 





1292 


2 

medium 

2489-1178= 1311 

1377 


3 

' medium 

2558-1180 = 1378 

1405 


5 

medium 

fundamental v 9 

1587] 





1608 

■II 

3 

very strong 

fundamental v xl 

1626 J 





1722 


10 

weak 

? 

1863 


4 

strong 

fundamental p 13 

1993 


6 

medium 

1008+ 981 = 1989 





1178+ 820= 1998 





1178+ 793 = 1971 

2134 


10 

very weak 

1180+ 981 = 2161 

2217 


10 

very weak 

1405+ 820 = 2225 





1405+ 793 = 2198 

2353 


8 

strong 

1180+1178 = 2358 

2558 


10 

strong 

fundamental v 16 

2625 


10 

strong 

fundamental v 8 




Raman spectrum 

v (obs.) cm. -1 

intensity 

assignment 


793 


5 

fundamental p 4 (B n -B u ) 


806 


4 

fundamental p 4 (B^-B 10 ) 


821 


3 (broad) 

fundamental v 4 (B 10 ~d3 10 ), v Vi 


1008 


diffuse 

fundamental v 1 


1180 


2 

fundamental v z 


2102 


10 

fundamental v 2 


2489 


3 

fundamental v xl 


2523 


10 

fundamental v x 


shifts originally recorded were later attributed to mercury lines,) In the infra-red 
spectrum the low resolving power used makes it difficult to determine the band 
contours with certainty. However, three of the bands appear to be of the parallel 



365 


The normal vibrations of bridged X 2 Y 6 molecules 

type, and for these the positions of the three branches are given in the table. In 
other cases only the centres of the bands are recorded. Bands in the range 3000- 
6000 cm. -1 have not been included in the table, since their positions are not 
known with sufficient accuracy to permit of any certain assignment. They will 
all be combination frequencies, and all of them can be accounted for in several 
different ways. 

Frequency assignments for the ethane model. Attempts to analyse the vibrational 
spectrum on this basis (Stitt 1941; Bauer 1942) have not been very successful. The 
observed spectrum is much more complex than that of ethane, and in particular 
there are too many strong lines observed in the region corresponding to B-H bond¬ 
stretching (2000-3000 cm. -1 ). This difficulty appears in a simple form in the Raman 
spectrum, where the two most intense frequencies, 2102 and 2523, must both be 
attributed to totally symmetrical fundamentals, though the ethane model predicts 
only one stretching frequency of this type. Moreover, the infra-red spectrum contains 
eight bands which appear to be strong enough for fundamentals, compared with 
five predicted by the ethane model, and of these the one at 412 cm. -1 is much too 
low to be readily accounted for. In order to meet these difficulties Stitt (1941) has 
postulated the existence of one or more low-lying electronic levels in diborane, and 
has given two alternative frequency assignments on this basis. However, these 
assignments still involve difficulties, and there is no independent evidence for the 
existence of low-lying electronic levels in diborane or in similar molecules. Anderson 
& Burg (1938) have suggested that the two Raman lines 2102 and 2523 may represent 
a resonance splitting, but it is most unlikely that a separation as great as 421 cm. -1 
could arise from this cause. 

Frequency assignments for the bridge model. As previously mentioned (Longuet- 
Higgins & Bell 1943) these difficulties disappear if the bridge formula is adopted for 
diborane. The absence of degeneracy allows eight distinct frequencies to appear in 
the infra-red (see table 1), and the low frequency 412 is naturally interpreted as the 
fourth-power bending vibration v 10 . Moreover, there are now two totally symmetrical 
types of bond-stretching (v x and v 2 ) to be allotted to the Raman frequencies 2102 
and 2523. 

Most of the fundamental frequencies can be allotted with little ambiguity, and 
some information is given by the infra-red band contours. The bridge structure for 
b 2 h 6 is approximately a symmetrical top, the moments of inertia about the y and z 
axes differing by less than 10 %> and it may therefore be expected that the bands 
can be classified as parallel (B Su ) and perpendicular (B lu and B 2u ). The expression 
given by Gerhard & Dennison (1933) predicts a separation of about 40 cm. -1 between 
the P and B branches, which agrees approximately with the bands marked as 
parallel in table 2. The apparently parallel character of the band at 412 cm. -1 is 
variance with its allocation as v 10 of B lu , but this region of the spectrum is very near 
the limit of transparency of the prism used, and the contours are uncertain: more¬ 
over, v w is not an ordinary harmonic vibration, and we shall s^e later that there 
are other possible reasons for the structure of this band. 



366 R. P. Bell and H. C. Longuet-Higgins 

The assignments are most readily made by dividing the normal vibrations into 
the following categories. (The hydrogen atoms in the ring will be distinguished asH'). 

(а) Stretching of terminal B-H links (v u v lv v 1Q ). The vibrational frequency 
of the spectroscopic molecule BH is 2366 (King 1938 ), and the B-H stretching fre¬ 
quency in triborine triamine, B 3 N 3 H 6 , is 2535 (Crawford &'Edsall 1939 ). We should 
therefore expect these frequencies to lie in the range 2300-2700, with v x ^ v u 
(cf. figure 1 ). From its intensity v x = 2523, leaving v n = 2489. In the infra-red 
v 16 (=-= v 1 ) = 2558, leaving 2353 or 2625 for v 8 . Of these two frequencies only the first 
can be satisfactorily interpreted as a combination tone, so that we have assumed 
v 8 = 2625. v 16 should be a parallel band, but this would probably escape detection 
owing to the low resolving power. 

(б) Ring-stretching (v 2 , v % , v 13 , f 17 ). These frequencies will be somewhat lower than 
those of type (a), since the linkage B-H' will be weaker than the terminal B-H link. 
The intense Raman line* 2102 is clearly the totally symmetrical v 2 , and 1608 (parallel) 
= v 17 . This leaves x > 13 = 1861 in the infra-red, while v 6 is probably not observed. 

(c) B-B vibration (y 4 ). This differs from type ( b ) in that the B-H' links are only 
slightly stretched, while the forces between unlinked atoms in the ring come into 
play. No prediction of frequency can be made, but the vibration should appear 
strongly in the Raman spectrum and should show a marked separation for the 
isotopes B ia and B 11 : hence it is identified as the triplet 793, 806, 821. The component 
at 821 is broad and is considerably more intense than would be expected from the 
relative abundance of the isotopes, and it is likely that there is a second Raman 
frequency at this point. 

(d) Angular motion of terminal B-H groups (v z , v 5 , v 7 , i> 9 , v n , v u , v 15i v 18 ). By 
analogy with other compounds of hydrogen these frequencies should lie in the 
neighbourhood of 1000 . They can be further classified as twisting (v 5 and iq 5 ), 
deformation (y 3 and v 18 ), rocking (v 9 and v 12 ) and bending (v 7 and v u ) (cf. Linnett 
1938 ). The twisting frequency v 5 is inactive, and v lh is likely to appear very weakly 
if at all. v 3 and v 18 both involve changes in the angle HBH; v z should be the most 
intense Raman frequency in this region, and v ls a parallel band. Clearly v z = 1180 
and v 18 = 1178. The remaining frequencies involve only changes in the angle HBH', 
and presumably include 821 and 1008 in the Raman spectrum, and 981 in the infra¬ 
red. A closer assignment can only be made by appealing to the equations (3), which 
show that v x > v 12 and v 9 > v u , for any reasonable values of the force constants. This 
suggests v lz = 821, v 7 = 1008, the diffuse character of the latter being in accord 
with the considerable isotope effect (ca. 20 cmr 1 ) predicted by equation (3) B lg . 
There are only weak infra-red bands between 412 and 981, so that v u = 981. The 
remaining frequency might be any one of the bands of medium intensity 1292, 
I%77 or 1405. In choosing v 9 = 1405 we have been guided by the results of the 
quantitative treatment in the next section. 

The above assignments account for fifteen out of the seventeen active funda¬ 
mentals, and one ^of the two missing frequencies would be expected to have low 
Raman activity. Moreover, fourteen of the fifteen assignments relate to observed 



367 


The normal vibrations of bridged X 2 Y 6 molecules 

frequencies described as ‘strong’, and there is only one such observed frequency 
(2353) which cannot be interpreted as a fundamental. The high intensity of this 
band may be related to the close coincidence between j> 3 +v 18 = 2358, 2p 3 = 2360, 
2v ls = 2356. The combination tones assigned in table 2 involve only observed 
fundamentals, and are allowed by the selection rules derived from table 1 . An 
alternative explanation of the weak infra-red frequencies 831 and 2134 is to identify 
them with the fundamental Raman frequencies 821 and 2102 , since in the un- 
symmetrical molecule B 10 B U H 6 these could appear weakly (and with slightly 
enhanced frequency) in the infra-red spectrum. Further alternative assignments 
for combination tones are considered at the end of the next section. 

Application of normal co-ordinate treatment. The general equations (3) CQntain 
eight force constants, but in the case of diborane it is reasonable to reduce this 
number to six by putting d 3 — g 2 = 0 . d 3 refers to the bending of the ring angle 
BH'B, and the two links attached to hydrogen would not be expected to have 
directional properties. Similarly, g 2 refers to the interaction between the two un¬ 
linked H' atoms in the ring, which is likely to be small compared with the interaction 
between the boron atoms, represented by g x . We have also assumed the value 
/ 2 = 3-42 x 10 5 dynes/cm. derived by Crawford & Edsall ( 1939 ) for B-H stretching 
in triborine triamine, since the terminal links in diborane should be normal single 
bonds. The equations also contain the dimensions of the molecule, for which we have 
taken the values derived from electron-diffraction data (Longuet-Higgins & Bell 
1943 ), i.e. a = 1-39 A, b = 1-18 A, d x = 50°, 0 2 = 60°. 

There then remain five adjustable force constants to be derived from the observed 
fundamental frequencies. Of these, the most difficult to fix is d t (or the related d s ), 
which occurs in the equation for every symmetry class. We have chosen the value 
<#4 = 0-302 x 10 s dynes/cm. as giving the best general representation of the observed 
bending frequencies. The remaining four force-constants were then derived by 
assuming successively p 13 = 1861, v 17 = 1608, v 18 =1178, v 4 = 793, when all the 
remaining frequencies could be calculated. 

The force constants obtained are given at the head of table 3, followed by the 
calculated and observed frequencies. The former are given to the nearest 5 cm. -1 , 
and those observed frequencies which were used in deriving the force constants are 
in parentheses. The mean deviation between calculated and observed frequencies 
is 2 %, and the maximum deviation 6 %. Still better agreement could be obtained 
by abandoning the assumption d 3 = g 3 = 0: for example, if g 2 — 0-3 the discrepancy 
of 6 % for v 2 disappears, and v 4 is increased by only 2 %. However, it does not seem 
worth while to make any further adjustments, since the agreement shown in table 3 
is quite as good as has been previously obtained for molecules of comparable com¬ 
plexity, using a valency-force potential energy function without cross' terms. More¬ 
over, the calculated bending frequencies are rather sensitive to the values taken 
for the angles d 1 and d 2 and the ratio 6 /a, and it is unlikely that these quantities are 
given to better than ± 5 % by the electron diffraction data (ef. Diatkina & Sirkin 
1942 ). 



368 


R. P. Bell and H. C. Longuet-Higgins 


Table 3. Force constants and frequencies for diborane 

force-constants (in 10 5 dynes/cm.) 


/ 1 =1*43, / 2 =3*42, ^ = 0*059, d 2 = 0-243, d 3 = 0, d 4 = 0*302, ft = 2-21, ft = 0. 


frequencies (in cm.” 1 ) 



calc. 

obs. % diff. 


calc. 

obs. % diff. 

Alg V ± 

2495 

2523 

-1 


2580 

2489 

+ 4 


1985 

2102 

-6 

*12 

800 

821 

-2 

»3 

1180 

1180 

0 

B*u *13 

(1861) 

1861 

0 

n 

(792) 

792 

0 

*14 

920 

981 

-6 

A lu v 5 

890 

(inactive) 

— 

Bza *15 

1505 

— 

— 

Big Vq 

1540 

— 

— 

B&U ^16 

2475 

2558 

— 2 

V 7 

* 1050 

1008 

+ 4 

*17 

(1608) 

1608 

0 

Bm *>8 

2585 

2625 

-2 

*18 

(1178) 

1178 

0 

V* 

1450 

1405 

+ 3 





*10 

266, 350, 407 412 

-1 





Wagner ( 1943 ) has attempted 

a still 

greater simplification, putting 

a = 6, 

h = 20 a 

= 109°, A 

=/«- ioA = 

10 d % = 

0 

1— 1 

II 

CO 

O 
1— 1 

.= 2-5 x 10 5 , g 1 = g 2 = 

0 , but 


his calculated frequencies do not agree even qualitatively with the observed spectrum 
of diborane. This is chiefly because (as already mentioned) his equations for classes 
B 2g and B 3u are incorrect, which leads him to identify the observed band at 412 
with v m and to assume that v 10 is not observed. 

The fourth-power frequency v 10 calls for special comment. When the above values 
of the force constants are inserted in equations (7) and (9) we find v = 266 cm ." 1 
for the lowest spectrum frequency, which is too low to be observed directly in the 
infra-red. It is of course possible that this calculation is considerably in error, but 
it seems more likely that the observed band centred at 412 is an overtone of this 
same vibration rather than the fundamental. In a harmonic vibration the transitions 
1, 1 <-+ 2 , 2«-+ 3, etc., all give the same spectrum frequency, but in a fourth-power 
vibration these frequencies are in the ratio 1:1-32: 1*53: 1*69; 1*82, etc. (Bell 1944 ). 
If the lowest frequency is in fact 266, this would produce a series of bands at 266, 
350, 407, 450, 484, etc., of which the third agrees well with the observed frequency 
of 412. Moreover, it is easy to show that the intensities of the first two overtones 
will be comparable with that of the fundamental. In a given mass of gas the in¬ 
tegrated intensity of an absorption band for the transition m^n is proportional to 
( x mn ) 2 v mn ex P (— > where E m is the energy of the mth level, and x mn is the 

matrix element jijr m ij!r n xdx (Bethe 1933 ). Approximate values of x 01 , x 12 and x 2S 

can be computed from the wave functions previously derived (Bell 1944 ), giving 
1: 0*69: 0*21 for the relative intensities of the bands at 266, 350 and 407. Absorption 
decreases slowly on the short-wave side of the observed band and is still perceptible 
at 500 cm." 1 : this can be attributed to the superposition of bands of decreasing 



369 


The normal vibrations of bridged X 2 Y 6 molecules 

intensity corresponding to the transitions 3->4, 4-*5, etc. The apparently parallel 
nature of the observed band could easily arise from this complex structure in con¬ 
junction with the isotope effect. 

The details of this interpretation could only be satisfactorily tested by obser¬ 
vations with higher resolving power and at lower frequencies. For example, if the 
lowest frequency were 310 cm . -1 instead of 266 cm . -1 the observed band at 412 
would be the first rather than the second overtone. On the other hand, even if we 
attach no weight to the calculated value of 266 it does not seem likely that 412 is 
a fundamental, since such an assumption would predict specific heats which are 
considerably too low over the whole temperature range (cf. next section). 

The calculated values for v 6 , v 6 and v 10 lead to some additional assignments 
for combination tones, e.g. 


1008 + 266* = 12741 
1540-266* = 1274/ 
1608-266* = 1342 
821 + 890 = 1711 
1863 + 266* = 2129 
2102 + 266 = 2364 


(obs. 1292) 

(obs. 1377) 
(obs. 1722) 
(obs. 2134) 
(obs. 2353) 


In the cases marked * the selection rule for B lu is violated, but, as already indicated, 
the fourth-power vibration v 10 does not strictly belong to this symmetry class. It 
may be observed that the assumption v 10 = 310 (with the observed 412 as its first 
overtone) leads to a somewhat more satisfactory interpretation of combination 
frequencies. 

The specific heat of diborane 

The specific heat of gaseous diborane has been measured over the temperature 
range 100-300° K (Stitt 1940 ). In order to account for the observed values on the 
basis of the ethane structure it was necessary to assume that the rotation of the 
—BH 3 groups is restricted by a barrier of either 5000 or ca. 15,000eal./mol., according 
to the assumptions made about the supposed low-lying electronic states (Stitt 1941 ). 
Both figures are higher than the value usually accepted for ethane (ca. 3000 caL/mol.), 
which is surprising, since the hydrogen atoms are further apart in diborane than in 
ethane, and it is unlikely that the B-B link would have any double-bond character. 

It is therefore of interest to use the frequencies in table 3 to calculate the specific 
heat of the bridge structure. This can be done without introducing any adjustable 
parameters, since the calculated values can be used for the inactive frequency v B 
and the unobserved frequencies v 6 and v 15 . The standard harmonic oscillator expres¬ 
sion has been used for all the frequencies except the fourth-power vibration v 10 . 
The contribution of this vibration was calculated as previously described (Bell 
1944 ), assuming the lowest spectrum frequency to be 266 cm. -1 . The values obtained 
are given in table 4. 



370 R. P. Bell and H. C. Longuet-Higgins 

The agreement is satisfactory, except at the highest temperature, where the cal¬ 
culated specific heat is somewhat too low. Part of this discrepancy is probably due 
to neglect of anharmonicity, which would produce too low values for c v . Anhar- 
monicity is particularly marked in hydrogen compounds and is likely to produce an 
appreciable correction even at 300° K for the eight frequencies below 1200 cm. -1 . 
A correction corresponding to an effective lowering of each of these frequencies by 
5 % would be sufficient to remove the discrepancy at 300° K., It is of course also 
possible that the unobserved twisting frequencies v s and v 15 are not correctly 
predicted by the angular force constant d 4 derived from other types of vibration. 
If these two frequencies are reduced by about %o % the observed specific heats are 
reproduced within experimental error without making any anharmonicity correction. 


Table 4. Specific heat of diborane 


T (°K) 

100 

150 

200 

250 

300 

c v (cal./mol.) obs. 

6-35 ±0*2 

6*90 ±0*2 

8-15 ±0*1 

9*80 ±0*1 

11*60 ±0*2 

calc. 

6*57 

7*15 

8*06 

9*49 

10-80 


The valency structure of diborane 

The last two sections can be summed up by saying that the observed vibra¬ 
tional spectrum of diborane is in excellent qualitative and quantitative agree¬ 
ment with the predictions of the bridge model, while the specific heat results are 
compatible with the same model. This strengthens the evidence previously given 
(Longuet-Higgins & Bell 1943 ) in favour of the bridge configuration of diborane. 
The values of the force-constants given in table 3 also provide material for speculation 
as to the nature of the links involved in the bridge structure. 

The terminal B-H stretching vibrations are well represented by the same force 
constant / 2 used by Crawford & Edsall ( 1939 ) for triborine triamine, thus confirming 
the assumption that the terminal links are normal single bonds. On the other hand, 
the constant f l9 referring to the B-H' link in the bridge, is less than half as great as 
/ 2 , as would be expected if the bridge linkages are one-electron or resonance bonds. 
According to the relation proposed by Douglas Clark ( 1934 ) the ratio of the distances 
B-H and B-H' should be (3-42/1 *43)* = 1*16, agreeing within experimental error 
with the electron-diffraction value of 1-39/M8 = 1-18. The bending force constants 
cannot be directly compared with any data, but d 4 and d % are close to the values 
035 and 0-23 given by Crawford & Edsall for planar and noh-planar bending in the 
system H-B-N. On the other hand the constant d x is considerably smaller, in harmony 
with the fact that it refers to the angle between two B-H' links, which (whatever 
their exact formulation) contain only two electrons between them. 

The value obtained for g x (representing interaction between the boron atoms) 
is an upper limit, since it involves the assumptions g 2 = c ? 3 = 0 , but its value cannot 
be less than about 1*5 x 10 & dynes/cm. on any reasonable set of assumptions. In 
any event, g x is less than half as great as would be expected for a B-B link, though 



The normal vibrations of bridged X 2 Y 6 molecules .371 


it seems rather large to represent an interaction between two unlinked atoms. If 
\ -- H - / 

the bridge structure J>B< _>B<^ is to be represented as a resonance hybrid of normal 

valency states, the relatively high value of g x could be related to the occurrence of 

\ H \ / 

structures such as p>B+ \ among these states. However, it should be noted 


that in a recent analysis of the vibrational spectrum of cyclo-butane (Wilson 1943 ) 
the potential function derived contains large cross-terms which are physically 
equivalent to interactions between unlinked atoms. The diagonal distance in the 
cyclo-butane ring is 2-16 A (compared with T79 A in diborane), and it would not 
be generally supposed that the structure of this molecule involves any resonance 
states with charges on opposite carbon atoms. In our opinion, therefore, the force 
constants which we have derived do not provide any evidence for the predominance 
of ionic states in the diborane molecule, as proposed by Sirkin & Diatkina ( 1941 ). 
The whole question is to some extent a formal rather than a real one, since the 
representation of the bridge linkage as a resonance hybrid is at best an artificial 
way of representing the actual electron distribution. A better picture of the true 

state of affairs might be obtained by treating the four-electron system ^>B+ B-h^ 

by the molecular orbital method, which has not yet been attempted. 

It is also possible to estimate a lower limit for the energy absorbed in the hypo¬ 
thetical reaction B 2 H 6 -*■ 2 BH 3 . Experiment shows that although diborane is a 
very reactive substance at ordinary temperatures and often produces compounds 
containing the BH 3 group, its vapour density indicates no signs of dissociation into 
2 BH 3 . ^ is therefore safe to assume that PbiiJPb^ < 10 ~ 3 atm. at 300°K,i.e. 
AF m >i-2 kcal./mol. On the other hand, we can estimate the value of AF on 
theoretical grounds. In the molecule B 2 H 6 v 10 is the only frequency low enough to 
contribute appreciably to the free energy at room temperature, and in BH 3 the 
vibrational contribution can presumably be neglected. Thus in the reaction 
B 2 H 6 -»■ 2 BH 3 three translational and three rotational degrees of freedom are formed 
from six vibrations, five of which contribute nothing to the free energy. The value 
of AF can then be derived from the standard expressions, using the dimensions 
already given for B 2 H 6 and assuming BH 3 to be a planar molecule with B-H = 1 • 18 A. 
The result is AF 300 —AE 0 — —9-4 kcal./mol., where AE 0 is the energy change at 
"absolute zero. Combining this with the experimental result for AF we find 
AE 0 > 13-6 kcal./mol. as a lower limit for the energy of dissociation of B 2 H 6 into 2 BH 3 . 


The Raman spectra of the aluminium halides 

i 

These spectra have been measured in the fused Itate by Gerding & Smit ( 1941 ) 
and by Rosenbaum ( 1940 ). Their data are given in table 5. Since the infra-red 
frequencies are not available, it is impossible to carry out such a detailed analysis 
as for diborane, but a reasonably certain assignment can be made by using a sim- 



372 R. P. Bell and H. C. Longuet-Higgins 

plified form of the valency-force equations. We have calculated the Raman fre¬ 
quencies of classes A lg , B lg , B ig and B Zg from equations (3) assuming throughout 

a = b, 2 9 1 = 26> 2 = 109°, 

A = A = - lOdj. = 10d 2 = 104 = 1-6 x 10 5 , 4 - 0. 

The values obtained are given in table 6 , together with the suggested assignments 
of the observed frequencies. There is of course no exact agreement between the 
observed and calculated frequencies, but the general pattern of the observed spectra 
is well reproduced by the simplified equations. The trend of the values shows that 
better agreement would be obtained if the force constants were assumed to decrease 
in passing along the series A1 2 C1 6 , Al 2 Br 6 , A1 2 I 6 ; this is what would be expected on 
chemical grounds. 


Table 5. Observed Raman frequencies for the aluminium halides 


(Figures in parentheses represent intensity and state of polarization) 



, A1 2 C1 6 

Al 2 Br 6 


Al a I„ 



112 

(6, dp?) 

73 (6) 


53 

(6) 



164 

(3, dp) 

112 (3) 


94 

(6) 



217 

(5, P) 

140 (5) 


146 

(10) 



284 

(2, dp) 

176 (2) 


195 

(i) 



340 

(10, P) 

204 (10) 


344 

m) 



438 

(h dpi) 

221 (£) 


406 

(3) 



506 

(3 ,p) 

291 (J) 






606 

m, dp) 

407 (2£) 




, 




491 (3) 






Table 6 . 

Assignments and calculated Raman frequencies 




FOR THE ALUMINIUM HALIDES 






Al a Cl 6 

Al 2 Brg 



A1 2 Iq 



calc. 

obs. 

calc. 

obs. 


calc. 

obs. 

A lff v ± 

519 

606 

485 

491 


476 

406 

v z 

341 

340 

231 

204 


188 

146 

v* 

200 

217 

144 

140 


107 

94 


120 

112 

76 

73 


54 

53 

Biff 

453 

438 

434 

(291?) 


430 

(195?) 


189 

284 

129 

176 


103 

— 

B 2S7 

484 

506 

449 

407 


438 

344 


137 

164 

96 

112 


103 

— 

Bzff ^15 

180 

— 

120 

— 


95 

— 


In each case the four most intense lines are assigned to the totally symmetrical 
class A lg , and the twisting frequency v 15 is not observed, as was also found to be the 
case for B 2 H 6 . The only observed frequency not included in table 6 is the weak line 
of Al 2 Br 6 at 221, which may be a combination tone 140+73 = 213. (The corre¬ 
sponding combination tones for AI 2 C 1 6 and A1 2 I 6 would coincide almost exactly 



373 


The normal vibrations of bridged X 2 Y 6 molecules 


with the fundamentals 340 and 146 respectively.) It is also likely that the weak 
lines 291 (Al 2 Br 6 ) and 195 (Al 2 I e ) are not fundamentals, since they differ greatly 
from the calculated values, and can be represented as 176 + 112 = 288 and 
146 + 53 = 199 respectively. The true fundamentals'of v 6 for Al 2 Br 6 and A1 2 I 6 may 
be obscured by v n . 

The only unsatisfactory feature of the above scheme is that the lines assigned to 
v 1 and v u for A1 2 C1 6 are described respectively as depolarized and polarized, while 
in theory the reverse should be the case (cf. table 1 ). The degree of polarization 
recorded for 506 is only 0-3, so that this objection is not a decisive one, but it is 
possible that the assignments of v x and v n should be reversed. 

Kohlrausch & Wagner ( 1942 ) have given assignments for the aluminium halides 
which differ considerably from ours, though they were arrived at by a similar treat¬ 
ment. This is because three of their four frequency equations contain errors, which 
led them to suppose that v 12 was not observed, and to interpret two of the strong 
observed lines as combination tones.* Kohlrausch & Wagner have also measured 
the Raman spectra of C 2 C1 6 and aluminium' trimethyl. If the C 2 C1 6 spectrum is 
compared with those of the aluminium halides, the differences in general pattern 
are so marked as to exclude an ethane-like structure for the latter: this agrees with 
the results of electron-diffraction measurements (Palmer & Elliott 1938 ; Brode 1940 ). 
In the spectrum of aluminium trimethyl the higher frequencies can be attributed to 
internal motions of the 0H S groups, while the low-frequency end of the spectrum 
differs considerably from that of C 2 C1 6 and shows some resemblance to the aluminium 
halide spectra. Kohlrausch & Wagner have concluded that dimeric aluminium 


trimethyl has the bridge structure 


Mev /Me. /Me 

>A1< >A1< , like the dimeric halides. How- 

Me^ Nm©/ \Me 


ever, in this case, electron-diffraction measurements (Brockway & Davidson 1941 ) 
are definitely in favour of an ethane rather than a bridge structure. Moreover, the 
evidence for the existence of a definite species Al 2 Me 6 is not strong, since although 
the measured vapour (Laubengayer & Gilliam 1941 ) corresponds roughly to di¬ 
merization at the lowest temperature used, the plot of apparent molecular weight 
against temperature has not flattened out, suggesting the existence of higher poly¬ 
mers. We have therefore not attempted to interpret the spectrum of this compound. 


Our best thanks are due to Dr J. W. Linnett for his advice and assistance. 


References 

Anderson, T. F. <& Burg, A. B. 1938 J. Chem . Phys. 6 , 586. 

Bauer, S. H. 1942 Chem. Rev. 31, 43. 

Bell, R. P. 1944 Proc. Roy. Soc . A, 183, 328. 

Bethe, H. 1933 Handbuch d. PhysiJc , 24, 434. 

Brockway, L. O. & Davidson, N. R. 1941 J r Amer , Chem . Soc . 63, 3287. 

" Brode, A. 1940 Ann. Phys ., Lpz., (5), 37, 344. 

* Two of the three incorrect equations were subsequently corrected (Wagner 1943 ), but 
no revision was made in the calculated frequencies or in the assignment proposed. 


Vol. 183. A.. 


25 



374 


R. P. Bell and H, C. Longnet-Higgins 

Crawford, B. L. & Edsall, J. T. 1939 J. Chem. Phys. 7, 223. 

Diatkina, M. E. & Sirkin, J. K. 1943 C.R. Acad. Sci. U.R.S.S. 35, 180. 
Douglas Clark, C. H. 1934 Phil. Mag. (7), 18, 459. 

Gerding, H. & Smit, E. 1941 Z. Phys. Chem. B, 52, 171. 

Gerhard, S. L. & Dennison, D. M. 1933 Phys. Rev. 43, 197. 

King, G. W. 1938 J . Chem. Phys. 6 , 378. 

Kohlrausch, K. W. F. & Wagner, J. 1942 Z. Phys. Chem. B, 52, 185. 
Laubengayer, A. W. & Gilliam, W. F. 1941 J. Amer. Chem. Sac. 63, 477. 
Linnett, J. W. 1938 J. Chem. Phys. 6 , 692. 

Longuet-Higgins, H. C. & Bell, R. P. 1943 J. Chem. Soc. p. 250. 

Palmer, K. J. & Elliott, N. 1938 J. Amer. Chem. Soc . 60, 1852. 

Placzek, G. 1934 Handbuch der Radiologic , 6 , part 2 . 

Rosenbaum, E. J. 1940 J. 1 Chem. Phys. 8 , 643. 

Sirkin, J. K. & Diatkina, M. E. 1941 Acta Physicochim. U.R.S.S. 14, 547. 
Stitt, F. 1940 J. Chem. Phys . 8 , 981. 

Stitt, F. 1941 J. Chem. Phys. 9, 780. 

Wagner, J. 1943 Z. Phys. Chem. B, 53, 85. 

Wiberg, E. & Johannsen, T. 1942 Angew. Chem. 55, 38. 

Wilson, T. P. 1943 J. Chem. Phys. 11, 369. 


The dissociation energies of CO, N a , NO and CN 

By A. G. Gaydon and W. G. Penney 
Imperial College of Science and Technology, &PK 7 

(Communicated by Sir Alfred Egerton, Sec.R.S.—Received 30 March 1944) 

No previous set of values of the dissociation energies of CO, N a , NO and CN can be regarded 
as satisfactory unless violations of the nonprossing rule are postulated as possible. 

The Birge-Sponer method of extrapolating vibrational energy levels to the dissociation 
limit is discussed. By assuming that the intensity variations in the Fourth Positive bands of 
CO, usually regarded as due to a predissociation, are really perturbations, and by adopting an 
alternative interpretation to the commonly accepted one for the predissociation in the First 
Positive bands of N a , it is possible to reconcile existing data with both the non-crossing rule 
and the Birge-Sponer extrapolations only with the following values D(CO) = 11*11 eV, 
D(N 2 ) = 9*764, D( NO) = 6*49, D(CN) = 7*5. The corresponding value of the latent heat of 
sublimation of carbon is 170*1 kcal./mol. This scheme of values explains the results of related 
experimental measurements at least as well as any other. 


Introduction 

The energies of dissociation of a number of diatomic molecules have been deter¬ 
mined, apparently with high accuracy, by observation of predissoci^tion limits. 
In some cases, the reliability of the values obtained depends on the validity of 
certain assumptions concerning the degree of excitation of the products of dissocia-> 
tion, and in some cases the values at present accepted are very different from the 
values expected from extrapolation of known vibrational energy levels to the 
dissociation limit. 

The recent history of the supposed determinations of the energy of dissociation 
of carbon monoxide, a molecule whose spectrum is well known and has been the 



375 


The dissociation energies of CO, N 2 , NO and CN 

subject of intensive study, illustrates how difficult it has proved to reach agreement. 
The value for D(CO) originally put forward by Birge & Sponer ( 1926 ) was, with 
recent conversion factors, 11*34 eV. Lessheim & Samuel ( 1934 ) assumed that 
there was a predissociation in the upper level of the 5B bands and gave D(CO) = 
10*45 eV. Then Coster & Brons ( 1934 ) reported predissociation in the spectra 
of CO leading to D(CO) = 9*82 eV, but later (Brons 1935 ; Coster & Brons 1935 ) 
claimed to have observed predissociation in the a 1 !! upper level of the Fourth 
Positive bands and said that ‘this made .D(CO) = 8*41 eV perfectly certain’. 

*This was at first accepted by Schmid & Gero ( 1935 ) who later, however, appeared 
convinced that the true value was 6*92 eV (Schmid 1936 ; Schmid & Gero 1936 ). 
Herzberg ( 1937 ) reviewed the existing observations of predissociations and con¬ 
cluded that, with recent conversion factors, values of 6*92, 7*88, 8*42, 9*14, 9*85 
or 11*11 eV were possible according to the genuineness and interpretation of the 
observed limits, and summarized in favour of 9* 14, a value which he has recently 
(1942) defended. Calculations on the short-wave fluorescence limits of benzene 
and of the absorption spectra of polyenes (Kynch & Penney 1941 ) favour a value 
about 11 eV, while electron collision experiments (Hagstrum & Tate 1941 ) gave 
9*6, favouring a value of 9*85 if the predissociation limits are taken into account. 
From this remarkable collection of diverse values it does not seem possible to draw 
any safe conclusion, except perhaps that the whole position is unsatisfactory. For 
nitrogen a rather similar history could be written, but in this case a value of 7*38 eV 
appears to have been generally accepted, although it is in strong disagreement with 
the value 11 eV, obtained by extrapolation of the vibrational levels of the ground 
state, a value which can hardly be reduced by more than 2 eV on any normal 
scheme of extrapolation. 

Quite apart from the dbvious doubt engendered by the variety of values put 
forward, it is remarkable that most authors have ignored the non-crossing rule 
introduced by Hund, and stressed by Neumann & Wigner ( 1929 ) that potential 
energy curves of the same electronic species never cross. We therefore propose to 
apply the non-crossing rule systematically and see where it leads. As a preliminary 
to a discussion of individual molecules, a few brief general remarks will serve to 
express our views on the reliability of extrapolated dissociation limits and on the 
application of the rule. It is then proposed to discuss the dissociation energies of 
certain molecules affected by our conclusions. These energies are of considerable 
importance in thermo-chemistry, and the value of jD(CO) carries with it, as described 
by Herzberg ( 1937 ), a knowledge of the heat of sublimation of carbon, and is therefore 
of cardinal importance in determining the absolute heats of formation of all organic 
compounds. 


The Birge-Sponer extrapolation- 

When a number of vibrational energy levels of a molecular state are known, it is 
possible to extrapolate these levels to the convergence limit correlated with the 
energy of dissociation of that [particular molecular state. The extrapolation may be 



376 


A. G. Gaydon and W. G. Penney 

made either analytically or graphically. One of the simplest methods is to plot the 
intervals A G v+ % between the vibrational energy levels against the vibrational 
quantum number v and extrapolate the curve to zero A G v+ %. The dissociation 
energy is then given by the area beneath the curve. Alternatively the vibrational 
energy term G v may be plotted against A G v ; the intercept on the G v axis obtained 
by extrapolation then gives the dissociation limit. The accuracy of the extrapola¬ 
tion naturally depends on how many vibrational levels are known, and where only 
a few levels are known, so that a long extrapolation has to be made, the accuracy 
may not be high; but where a number of levels are known so that the general trend 
of the curve can be seen and allowed for, reasonably accurate values may be 
expected. 

In a detailed study of the vibrational levels of the upper electronic state of 0 2 
Birge ( 1929 ) found that the AG v+i — v curve showed an inflexion at high values of v, 
giving a slightly higher value for the dissociation limit than that obtained by 
extrapolation using only the lower levels. The second difference in A G v+ % showed 
a fairly sharp break, and Birge represented the curve for this second difference by 
two intersecting straight lines. More recent measurements by Knauss & Ballard, 
however, indicate that the change of slope is less sudden than was formerly sup¬ 
posed. Similar changes leading to slightly higher values of the dissociation energy 
have been observed for the upper electronic states of the halogens, IC1 and IBr. The 
upper electronic states of the alkali-metal hydrides are also anomalous, due to 
interaction with ionic terms, but do not show any sharp discontinuity. Watson & 
Koontz ( 1934 ) have reported a break in the values for the second difference in 
A G v+i for the upper state of the far ultra-violet system of which would in this 
case lead to a slightly lower value of the dissociation limit. However, all these 
discontinuities are slight and would not appreciably affect the determination of 
the limit. 

For AgH (Bengtsson & Olsson 1931 ) the curve for the excited electronic state 
shows a pronounced change of slepe which leads to a much higher dissociation 
limit than that which would be obtained by extrapolation from the first few 
levels. 

The AG v+i — v curve for the upper state of the red bands of K 2 studied by 
Loomis & Nusbaum ( 1932 ) shows a rather marked change of slope at high v, 
leading to a lower dissociation limit than that which would be obtained by extra¬ 
polation of the first part of the curve. Many of the weak bands from which data 
for the highest vibrational levels are derived are blended, while others could be 
analysed so as to form parts of weaker Franck-Condon parabolas of the type 
observed by Gaydon & Pearse ( 1939 ) for RbH. Thus it is hardly safe to rely on 
this example of a break in the curve towards a lower limit. There are, however, 
indications of similar changes in the curves for the excited states of other alkali- 
metals. 

It should be noted that all the discontinuities discussed above refer to excited 
states. The authors are unaware of any evidence (or a sharp change of slope in 



377 


The dissociation energies of CO, N 2 , NO and CN 

the curve for any ground electronic state, although a large number of such states 
have been examined. For some molecules, especially CO and ST 2 , many of the 
higher electronic states appear to possess only a few vibrational levels. In such 
cases, the vibrational levels seem to terminate abruptly, probably owing^to pre¬ 
dissociation caused by crossing of a repulsive potential curve, and they do not 
converge to a limit in the usual way. 

We conclude that there is little evidence that the extrapolation to the dissocia¬ 
tion limit, where it is made with reasonable care from a fair number of vibrational 
levels, is likely to be seriously in error for ground electronic states. The possibility 
of abrupt termination of vibrational levels due to crossing of the potential curve 
with a repulsive potential curve must however be borne in mind, especially for 
excited electronic states. 


The hoh-crossinu rule 

According to the non-crossing rule of Hund, and Neumann & Wigner ( 1929 ), 
potential energy curves of the same electronic species which would to a first 
approximation cross each other, will, in higher approximation, avoid this crossing. 

Two cases of avoided crossing are illustrated in figure 1 (a) and ( 6 ). The broken 
lines indicate the potential curves in first approximation where they cross, while 
the full-line curves represent the higher approximation where the crossing is 
avoided by the repulsion between the two curves. 




(®) * Figure 1 

It will be seen that where one of the curves, in first approximation, is repulsive, 
there is the possibility of the lower curve showing a maximum as well as the usual 
minimum characteristic of a stable state. We suspect that such maxima may be 
fairly common for molecules like CO and N 2 where we have evidence from the 
abrupt termination of vibrational levels that, at any rate among the higher 



378 A. G. Gaydon and W. G. Penney 

electronic states, there is considerable interference between the numerous potential 
energy curves. 

It should be noted that in an avoided crossing* of the type illustrated in figure 
1 (a) there is little chance of attributing vibrational levels of the upper curve to a 
continuation of the vibrational levels of the lower curve and so failing to dis¬ 
tinguish between the molecule in the two states as, in general, the rotational 
constants and the internuclear distance will not vary smoothly in passing from one 
electronic state to the other; the value of B will usually be appreciably smaller for 
vibrational levels of the upper arm than for those of the lower. In the actual 
neighbourhood of an attempted crossing the concept of potential curves probably 
has little meaning and only the energy levels may be regarded as real, but away 
from the actual crossing and its accompanying perturbations we may expect the 
vibrational and rotational energy levels to behave normally with different constants 
for the two states. 

In the case of a potential maximum, as illustrated in figure 1 ( 6 ), vibrational 
levels may exist above the dissociation limit, that is between the levels A and B 
in the figure. Molecules in these levels will therefore be in a metastable state and 
may be able to predissociate spontaneously. However, it does not always follow 
that their life will be short as, if the potential maximum is broad, there will be 
little overlap between the wave functions corresponding to the vibrational levels 
and the wave functions corresponding to the dissociated condition, and passage 
through the potential barrier may be very rare. Thus levels not much above A may 
behave essentially normally and strong transitions to and from these levels may be 
observed in the spectrum. 

As far as we are aware there is only one example, CaH, in which there is actual 
data indicating crossing of potential energy curves of the same species, and this 
appears to be due to an incorrect interpretation of the spectrum rather than to a 
violation of the rule.* For CO, N a , NO and CN, however, the'use of low values 
for the dissociation energy leads to a violation of the rule because known potential 
curves would have to cross repulsive curves of the same species arising from the 
lowest dissociation limits. 

In considering the nod-crossing rule in relation to the determination of dis¬ 
sociation limits we may well quote the caution given by Mulliken ( 1937 ): 

*... it would be wise to keep in mind the possibility of avoided crossing with 
fulfilment of the non-crossing rule and accompanied by maxima in the potential 

* For CaH, Grundstom shows potential curves (reproduced oy Herzberg 1939) in which 
that for the n 2 S+ state crosses that for o 2 S + . In Grundstom’s paper (1932) on the D system, 
several of the bands are found to show two extra branches (referred to as P* and P*) which 
cannot be accounted for on the existing analysis attributing the bands to a 2 2 -> 2 £ transition. 
Qne of us (A. G. G.) has rephotographed the (1,1) band on the 2 nd order of a 20 ft. concave 
grating spectrograph. The existence of the P* and P* branches is confirmed. It is also noted 
that the lines denoted P 1 ( 17 ) and 2 ^( 15 ) by Grundstom, are in reality double. This probably 
indicates that all the lines are really unresolved doublets, but only in one case, due to per¬ 
turbation, is the doubling visible. The transition is therefore probably 4 2 ~> 2 £ and not 2 2 -> 2 £. 
Thus the n level is not of the same species as the 0 level. 



The dissociation energies of CO, N a , NO and CN , 379 

energy curves of the states under consideration. It is likely that maxima, small or 
large, wl prove to be more common than has been supposed. 

‘If so, some of the existing conclusions as to dissociation energies based on pre¬ 
dissociation phenomena or on convergence of vibrational series to a limit may need 
revision. 

‘... Even in cases such as 0 2 , N 2 and NO there remains a shadow of doubt as 
to the reliability or at least the accuracy of the accepted values. For instance, the 
existence of hitherto unsuspected maxima in potential curves might affect some 
of these values; also, the use of predissociation data appears to be somewhat risky 
unless great caution has been used.’ 


The dissociation oe CO 

The history of the determinations of D(CO) has been given briefly in the intro¬ 
duction. The b 1 E upper level of the Angstrom bands and the 6 3 E upper level of 
the Third Positive bands both show definite predissociation at 11*11 eV, and this 
is certainly an upper limit for D(CO). Gero ( 1936 ) reported predissociation in bands 
of the Fourth Positive system with v' = 9 and 10 . Herzberg ( 1937 ) has discussed 
the three possibilities (a) that this predissociation is real and corresponds to a real 
limit, ( b ) that it is real but is accidental and does not correspond to a dissociation 
limit, and (c) that the predissociation is not real at all but is only a perturbation. 
The first possibility gives D(CO) = 6-92 eV, a value advocated by Schmid and Gero, 
the second possibility gives 9-14 eV, the value which Herzberg favours, while the 
third possibility, taken with the assumption that the predissociation at 11-11 is 
actually a limit, gives D(CO) as either 11-11 or 9-85 eV. Gero’s published photo 
showing this predissociation in the Fourth Positive bands, anfi our own spectro¬ 
grams taken on a Hilger E.l quartz Littrow spectrograph, seem to us to favour 
the third possibility, that the effect observed is only a perturbation and not a 
predissociation. Further, the superposition of a ‘3A’ band on the (9, 18) Fourth 
Positive band (Gero 1938 ) results in anomalous intensities of the lines of the latter, 
giving an effect resembling predissociation. 

Herzberg ( 1942 ) has used Faltings, Groth and Harteck’s observations on the 
photochemical decomposition of CO as an argument in favour of D(CO) = 9-14. 
We have already discussed this photodissociation (Gaydon & Penney 1942 ) and 
pointed out that with due consideration for the initial act of absorption this does 
not rule out 9-85 eV, even if the initial act is direct photodissociation, but that 
probably it is a case of induced photodissociation and therefore tells us nothing. 
Electron collision experiments indicate JD(CO) = 9 - 6 + 0-2 eV, but the curve from 
which this result is obtained appears to be of complex structure and might be 
capable of an alternative explanation. Certain chemical ealculatipns involving the 
latent heat of sublimation of carbon, and also Marshall and Norton’s direct deter¬ 
mination of the heat of sublimation favour D( CO) = 11. Extrapolation of the 



380 A. G. Gaydon and W- G. Penney 

vibrational levels of CO + favour a value a little over 10 eV (but see footnote added 
in proof*). 

We now propose to examine the application of the non-crossing rule to the 
various potential energy curves for CO. The lowest possible dissociation products 
are C( 3 P) + 0( 3 P). These lead, on the Wigner-Witmer correlation rules, to the 
formation of molecular states of the following types 

x 2 +( 2 ), X E- x n(2), X A, 3 2 + (2), 3 2 ~, 3 1 I( 2 ), 3 A 

together with a number of quintet states. Now, if the non-crossing rule is to be 
upheld, these molecular states arising from the lowest products of dissociation must 
either link up with the various known electronic levels of CO of corresponding type 
or must lie wholly below them so as to avoid crossing. An energy level diagram is 
shown in figure 2 ; this gives all the known electronic states of CO with the certainly 
known vibrational energy levels indicated for each. For application of the non¬ 
crossing rule, the u' 3 S + state is the most interesting. From the analysis of the 
Asundi bands, some 12 vibrational levels are known, but by following up per¬ 
turbations in the a 1 !! upper state of the Fourth Positive bands and in the b 1 2 + 
upper state of the Angstrom bands, Schmid & Gero ( 1937 ) have been able to track 
the vibrational levels up to v f = 41 for u' 3 E + (the v' values used in the original 
paper need to be raised by 2 units). The vibrational levels are smoothly spaced 
up to this point which is within about 1000 cm . -1 of the extrapolated convergence 
limit at 1 M eV. Now the normal dissociation products must result in two 3 2 + 
states, i.e. two states of the same species as a' 3 E + . The regularity of the vibrational 
levels of this state does not indicate any perturbations such as would result from 
an avoided crossing, so that if the rule is to hold a / 3 2 + must dissociate to normal 
products, or alternatively the two potential curves of 3 S+ type from these products 
must lie below that for a' 3 E + . In the latter case there must be two unknown 
3 E + states lying below 50,000 cmr 1 , and this is inconceivable. It would be just 
possible that the curve for <z' 3 2+ might reach a maximum at 1 M eV, and then 
descend to a limit below this point, but the exact coincidence of this extrapolated 
limit at 11*1 with the energy of the two known strong predissociations is too striking 
for, this hypothesis to appear tenable. If the non-crossing rule holds there can be 
no doubt that D(CO) = 11 1 eV. 

It may be noted that with the value for D(CO) favoured by Herzberg (9-14 eV) 
the a 1 !! state also violates the non-crossing rule. 

* (Footnote added in proof.) A recent letter by Anand (1942), reporting Rydberg series 
in the far 'ultra-violet spectrum of CO, suggests that I(CO) = 14-55 eV; this would give 
.D(CO + ) = .D(CQ) — 3 * 33 . The discrepancy between the extrapolated values for the various 
states of CO+ and those to be expected using D( CO) = 11-11 would thus be reduced appreciably. 

It may be noted that while a neutral molecule dissociates to uncharged ato ms , an ionized 
molecule such as CO + or Nf dissociates to one charged and one uncharged atom. It would 
be expected that the law of force between a charged and an uncharged atom would, at large 
internuclear distances, differ from that between two uncharged atoms. Thus it is not unlikely 
that for high values of the vibrational quantum number the form of the Birge-Sponer extra¬ 
polation for an ionized molecule may differ from that of an ordinary neutral molecule. 



The dissociation energies of CO, N 2 , NO and CN 


381 



Figure 2. Energy level diagram of CO. The main predissociations are indicated by JPr. 
The intensity fluctuations produced by irregularities in the a 1 !! state, which we believe to be 
a perturbation and not a predissociation, are indicated by ?P. The data for d 3 n are from 
Gero & Szab6 (1939); their analysis of the Triplet system does not, in our opinion, lead to a 
satisfactory intensity distribution; the v enumeration for d z II probably requires adjustment. 












382 


A. G. Gaydon and W. G. Penney 


The dissociation of C0 + 

The value of Z)(CO + ) can. be determined from the cycle 
D 0 (CO+) = D 0 (CO) + 7(C) - /(CO). 

Taking the ionization potential of carbon as 11-22 and of carbon monoxide as 14-1 
(Hagstrum & Tate 1941 ) we get 7>(CO + ) = /)(CO) —2-9.* In view of the doubt 
about CO it seems desirable to see if we can obtain any additional evidence from 
a study of the spectrum of CO + . 

Three band systems are known for C0+, the First Negative carbon bands B 2 E 
to x 2 E, the Comet-tail system a 2 II to x 2 E, and the Baldet-Johnson system b 2 S 
to a 2 II. From the analysis of these systems some thirteen vibrational levels, the 
highest with an energy of 25971 cm . -1 (= 3-21 eV), are known for the lowest, 
x 2 E, electronic state. These levels show an almost constant second difference, and 
Biskamp ( 1933 ) has made aiinear extrapolation to a limit at 79700 cm . -1 = 9-9 eV, 
a value which, in view of the length of the extrapolation may not be of high 
accuracy. For the upper b 2 2 state, which lies at an energy of 45652 cm . -1 (5-66 eV) 
above the ground state, ten vibrational levels with a total energy of 14543 cm . -1 
(1-80 eV) are known, these extrapolating, according to Biskamp, to 29600 cm. -1 , 
that is about 75252 cm .' -1 or 9-3 eV above the ground state. The a 2 1I state lies 
some 15600 cm . -1 (= 1-93 eV) above the ground state. Fourteen vibrational energy 
levels are known extending to 45762 cm . -1 (= 5-67 eV) above the ground state. 
We have extrapolated these vibrational levels to their dissociation limit, and 
assuming a normal convergence find a value of about 9-2 eV for this limit. , 

The lowest dissociation products for CO+ are C + ( 2 P) and 0( 3 P), and this com¬ 
bination will produce one 2 2 +, two 2 E - , two 2 IT and other molecular states of 
higher multiplicity and A value. There does not appear to be any experimental 
evidence as to whether the observed 2 E states of CO are + or —. If they were — 
then there would be no difficulty in assuming that all the three known states 
converged together to normal products at a dissociation limit of around 9 eV giving 
a high value for i)(CO). However, it is usually assumed that both 2 S states are 2 S+, 
an assumption which seems more reasonable a priori. If this is so then x 2 E and 
b 2 2 must dissociate to different limits. To avoid crossing, the x 2 S must dissociate to 
C+( 2 P) + 0( 3 P) while b 2 E must go to C + ( 2 P) + 0( 1 7)) some 1-97 eV higher. Taking 
the limit for b 2 2 as 9-3 this would give 7)(CO + ) as 7-3 and D(CO) as 10 - 2 ; to bring 
this limit up to that required to give D(CO) = 11-1 it would be necessary to postu¬ 
late a fairly considerable break in the AG v —v curve for b 2 E, as is in fact observed 
for one of the states of Nj. On the other hand the vibrational levels of a 2 II which 
should, on the non-crossing rule, dissociate to normal products, are known up to 
5-67 eV and appear as shown above to converge to a value of D( CO + ) a good deal 
higher than 7-3. It is probable that for CO + the potential curves for the two 


* See footnote on p. 380 . 



383 


The dissociation energies of GO, N 2 , NO and CN 

known excited electronic states are perturbed by other states and depart from the 
conventional form. All that may fairly be deduced is that data obtained from CO + 
favour a value for Z)(CO) above 10 eV. 


The dissociation energy oe N 2 

The spectrum of nitrogen shows definite predissociation in bands of the Second 
Positive system with v' = 2 , 3 and 4. This leads to a dissociation limit at 12-145 eV. 
Van der Ziel ( 1934 , 1937 ) has also found a weak predissociation in some of the 
upper vibrational levels of the B 3 n ff state, which is the initial state of the First 
Positive system. This predissociation occurs at 9*844 eV. Both of the components 
of the II levels resulting from A doubling are affected by the predissociation, and 
the strength of the predissociation increases with rotational quantum number. 
This has led van der Ziel to interpret the predissociation as due to a repulsive 
A state, and this argument leads to D(N 2 ) = 7-383 eV. 

As with CO, this low value of the dissociation energy implies a violation of the 
non-crossing rule. Some sixteen vibrational levels are known for the a 3 SJ state, 
these rising to about 1 eV above the dissociation limit N( 4 £)+N( 4 $) which should 
give rise to a 3 Z+ state. Similarly, vibrational energy levels for a 1 II u and B 3 II ff are 
known well above the N( 4 $)+N( 2 D) limit from which states of these types should 
arise. An energy level diagram for N 2 showing this is given by Herzberg ( 1939 ), 
who has accepted van der Ziel’s interpretation of the predissociation. 

This violation of the non-crossing rule led us (1942) to doubt the low value for 
the dissociation energy. It has now been shown by one of us (Gaydon 1944 a) that 
there is an alternative interpretation of van der Ziel’s predissociation. It has 
apparently been generally assumed that when a II state is perturbed or pre- 
dissociated by a 2 state only one of the two components of the II state due to A 
doubling is affected. Actually this is invariably true only for II states in Hund’s 
case b. In Hund’s case a the rotational quantum number J is the deter m i n i n g 
factor in the rotational energy of the II state and in the selection rules, and K is 
not rigorously a good quantum number. The b 3 ]!^ state of N 2 is intermediate be¬ 
tween Hund’s cases a and b but approximates sufficiently to the former to lead us to 
expect that the quantum number J will dominate K in the choice of selection rules. 

The lowest dissociation products for N 2 are i S+ l S, and from these atomic states 
a molecular 5 2 + state can arise. It is reasonable to suppose that this state is 
repulsive. Each rotational level (in so far as it is possible to define a rotation of a 
non-stable molecule) is defined by* a quantum number K, and each level K consists 
of five spin components J = K—2, K—l, K, K+l and K + 2 . The + or — sym¬ 
metry of all five spin components of any K level is the same (Symmetry + or — 
refers to reflexion in the origin of the nuclear and electronic coordinates, inclusive 
of spin). For a proof of the statement see Wigner & Witmer ( 1928 )). 

In order to interpret van der Ziel’s predissociation, we suppose that near the 
crossing of the potential curves of B 3 2 ff and- 5 2^ it is possible for a range of J values 



384 


A. G. Gaydqn and W. G. Penney 

in the stable II state to change spontaneously to the same J value in the S state. 
Any small adjustment in the rotational energy is easily accomplished by variations 
in the kinetic energy of the nuclei along the inter-nuclear direction, since this 
motion is not quantized in an unstable state. Each J value in the state occurs 
twice with one symmetry (4- or —) and thrice with the opposite symmetry (— or 
-f). Hence both components of the A doublets of the II state can predissociate. 

It is perhaps worth pointing out that if the S state had been stable, the mutual 
perturbations of the 23 and II states would be very much more limited because the 
condition of equal energy of levels with the same symmetry and J value will be 
satisfied only at one point in each series of levels. However, for CN, Wager (1943) 
has observed perturbations in both A components of a 2 II state, in Hund’s case a, 
resulting from interaction with a 2 E state. Levels of the a 2 II state, with vibra¬ 
tional quantum number v = 9, are perturbed by levels of the b 2 S state with v = 0. 
The levels of 2 EU with / = 3| and 7£ and of 2 II$ with J = 10J and 15J interact 
with levels of the 2 E state with K = 4, 7,11 and 15 respectively. Detailed examina¬ 
tion of this series of perturbations shows that both components of the II state 
(due to A doubling) are affected, but at different energies. 

With this interpretation of van der Ziel’s predissociation in N 2 , and making 
use of the definite dissociation limit at 12-145 eV, we obtain for the dissociation 
energy a value of 9*764 eV. The values of 7*383 and 8*573 are not ruled out by this 
interpretation of van der Ziel’s predissociation, but, as already stated, the 7-383 
value leads to a violation of the non-crossing rule by the and a l Ii u states 
of N 2 , while the 8*573 value implies a violation of the rule for several states of NO 
(see later). No violation of the non-crossing rule is indicated by the value of 9*764, 
which also agrees better with the values obtained by the Birge-Sponer extra¬ 
polation for the x 1 !! 4 -, a 1 ]!^, a 3 !! 4- and B*U g states of N 2 . 

It should be noted that Hagstrum & Tate ( 1941 ) have made a determination of 
the dissociation energy by electron collision experiments using a mass spectro¬ 
meter. They found that ionized nitrogen atoms appeared at 24-3 eV. Taking the 
ionization potential of atomic nitrogen as 14*55 and using the relation 

A(N+) = /(N)+D(N 2 ) 

we obtain directly D(N 2 ) = 9*75 + 0*2 eV, in good agreement with the above value 
of 9*764 eV. In order to bring their results into line with the spectroscopic value of 
7*383, Hagstrum & Tate assumed that the products of dissociation were excited 
atoms, but this now seems unnecessary. 

The dissociation enebgy oe N^ 

The dissociation energy of ionized nitrogen is linked with that of the neutral 
molecule by the cycle, _ /(N , )+I>(Nf 

Using the latest values for /(N) and /(N 2 ) as 14-55 and 15-58 we obtain 

DQSJ-DQStt) = 1-03. 



385 


The dissociation energies of CO, N 2 , NO and CN 

The First Negative hands of Nf correspond to a 2 E+-> 2 2* transition, and extra¬ 
polation of the vibrational levels indicates that both states probably have the 
same dissociation limit which is presumably the lowest, N( 4 $) + N + ( 3 P). Extra¬ 
polation of the vibrational levels of the lower state is of little value as the extra¬ 
polation to the limit is long. For the upper level, the situation is more favourable. 
Herzberg ( 1928 ) observed definite vibrational levels up to v' = 11 and using data 
for band heads , obtained D(Nf) around 6-9 eV. Since then Parker ( 1933 ) has made 
a rotational analysis of the (13, 15) First Negative band. Making use of this 



Figure 3 . Extrapolation of the vibrational levels of the upper, 2 E+, state of the NJ" First 
Negative bands to the dissociation limit. 

xxx Points obtained from Herzberg’s measurements (origins). 

4 - + Points based on Parker’s value for the origin of the (13, 15) band; this leads to a 
reliable value for <?1S. and, since O n is known, to the sum AGr ni +A(? 12i ; either of these points 
may he raised provided the other is lowered by an equal amount. 

O This point is based on the observation by Herzberg of the (14, 10) band at 3217-7 A; this 
is a weak band and this point may not. be reliable. 


additional data, and using values (calculated with the aid of the rotational con¬ 
stants) for band origins instead of heads (which differ significantly because some 
bands are degraded to the red and some to the violet and in some cases the heads 
and origins are far apart) we have repeated the extrapolation. The result is shown 
in figure 3, which is a curve between A G v+i and the total vibrational energy term O v . 
The values of D(N a ) to which the extrapolation would lead are indicated along the 
G v axis. It will be seen that the curve, like that, of AgH, shows a definite inflexion 
leading to a higher dissociation energy and that while a value for D(N a ) as high 
as 9-7 would have appeared unlikely from extrapolation of the lower vibrational 
energy levels, with the new data for higher levels, this value does appear possible. 



386 


A. G. Gaydon and W. G. Penney 

It will be noted that a smooth extrapolation to D(N a ) = 7-38 would be quite im¬ 
possible, and indeed the v' = 14 level itself lies slightly above this energy. 

We thus see that evidence obtained from the spectrum of ionized nitrogen does 
not contradict the new higher value for the dissociation energy of N 2 . The value 
for NJ is probably 9-764-1-03 = 8-76 eV. 

The dissociation energy of nitric oxide 

The dissociation energy of nitric oxide is linked with that for nitrogen and 
oxygen by thermochemical data (see Bichowsky & Rossini 1936 ) 

2 NO = N 2 + 0 2 4-1-87 eV. 

Taking D(0 2 ) = 5-082 and D(N 2 ) = 9-764 this leads to D(NO) = 6-49 compared 
with the lower value of 5-29 given by Herzberg ( 1939 ). 

One of us (Gaydon 19446 ) has recently studied the several band systems emitted 
by NO and has drawn up potential energy curves for all the known electronic 
states. It has been shown that it was extremely difficult to reconcile a low value 
of Z)(NO) with the non-crossing rule. The new value lies just above all the known 
energy levels for the /?, y and 6 systems, and although it is necessary to assume that 
the potential energy curves of the upper electronic states are largely determined by 
higher dissociation products, the non-crossing rule is not actually violated, and 
no energy levels are known above the expected dissociation limits for the respective 
states. 


The dissociation energy of cyanogen 

Using certain thermochemical data, including a somewhat low value for the 
dissociation of CjNa into 2 CN, Herzberg ( 1939 ) obtained D(CN) = 5-96 eV. Schmid, 
Gero & Zemplen ( 1938 ) have followed vibrational levels for the a 2 II state of ON 
up to v = 30 at an energy of about 51,000 cm . -1 (6-3 eV) by observing perturbations 
in the violet system. The extrapolated limit for a 2 1I obtained by Schmid, Gero & 
Zemplen is 60,500+ 1,000 cm .- 1 (7-5 + 0-12 eV). Now on the non-crossing rule a 2 II 
should dissociate to the lowest possible products, C( 3 P)+N( 4 < 8 ), so that if 
-D(CN) = 5-96 we are again faced with a violation of the non-crossing rule. 

However, if we use the new values D(CO) = 11 - 11 , D(N 2 ) = 9-76 and take (see 
data summarized by Bichowsky & Rossini 1936 ) 

C 2 N 2 + 20 2 = 2C0 2 + N 2 + 260 kcal. 

2 CO + 0 2 = 2CO a +134 kcal. 

and use White’s ( 1940 ) spectroscopically obtained value for the dissociation of 
C 2 N 2 into 2CN as 146 ± 4 kcal., we obtain 

D(CN) = 7-6 + 0-2 eV, 

which is in satisfactory agreement with the direct spectroscopic limit obtained by 
Schmid, Gero & Zemplen. 



The dissociation energies of CO, N 2 , NO and CN 


387 


Conclusion 

The only molecules which have hitherto been thought to violate the non-crossing 
rule (excepting the doubtful case of CaH) are CO, CN, NO and N g , which are just 
those molecules for which the Birge-Sponer extrapolation was in serious disagree¬ 
ment with the dissociation energies obtained by predissociation data. It has been 
shown that by interpreting the effect in the Fourth Positive bands of CO as a 
perturbation instead of as a predissociation and by using the alternative explana¬ 
tion for van dor Ziel’s predissociation in the nitrogen spectrum, violation of the 
non-crossing rule can bo satisfactorily avoided for all the above moleoules. 

We are therefore of the opinion that tho non-crossing rule is a reliable guide in the 
correlation of atomic! and molecular energy levels. The moat probable values for 
the dissociation energies appear to be 

D( CO) - 11-11 eV = 256-1 kcal., 

D(N 2 ) - 9-764 eV = 225-1 kcal., 

D(NO) = 6-49 eV =149-6 kcal., 

7>(CN) - 7-5 eV = 174 kcal., 

C*N a = 2CN - 6*3 eV (=146 kcal.). 

Latest heat of sublimation of carbon 170-1 kcal. 

One of us (A. O.O.) wishes to acknowledge generous financial assistance from the 
Warren Research Fund of the Royal Hociety. 


References 

Anand, B. M. 1942 Science and Culture, 8, 278 . 

Bengteson, E. & Olsson, E. 1931 Z. Phys. 72, 163 . 

Bichowsky, F. R. & Rossini, F. I). 1936 The. thermochemistry of the chemical substances. 
New York: Reinhold. 

Birge, R. T. 1929 Trans. Faraday Soc. 25, 707. 

Birge, R. T. & Sponor, H. 1926 Phya. Rev. 28 , 209 . 

Biskamp, H. 1933 Z. Phya. 86, 33 . 

Brons, F. 1935 Physica, Eindhoven, 2, 1108 . 

Coster, D. & Brons, F. 1934 Physica, Eindhoven, 1, 155 . 

Coster, IX & Brons, F. 1935 Amst. Proa, K, Akad. 38 , 961 . 

Gaydon, A. G. 1944a Nature, 153 , 407 . 

Gaydon, A. G. 19446 Proc. Phys. Soc. 56 , 160 . 

Gaydon, A. G. & Pearse, R. W. B. 1939 Proc. Roy. Soc. A, 173 , 37 . 

Gaydon, A. G. & Penney, W. G. 1942 Nature, 150 , 406 . 

Ger6, L. 1936 Z. Phys. 99 , 62 ; 100, 374 . 

Cero, L. 1938 Z. Phys. 109 , 210 . 

Gerd, L. & Szab<5, F. 1939 Ann. Phys., Lpz., 35 , 597 . 

GrundstriSm, B. 1932 Z. Phys. 75 , 302 . 

Hagstrum, H. D. & Tate, J- T. 1941 Phys. Rev. 59 , 354 . 



388 A. G. Gaydon and W. G. Penney 

Herzberg, G. 1928 Ann. Phys., Lpz ,, 86, 189 . 

Herzberg, G. 1937 Chem. Rev. 20, 145 . 

Herzberg, G. 1939 Molecular spectra and molecular structure . New York: Prentice-Hall. 
Herzberg, G. 1942 J . Chem. Phys. 10, 306 . 

Kynch, G. J. & Penney, W. G. 1941 Proc. Roy. Soc. A, 179 , 214 , 

Lessheim, H. & Samuel, R. 1934 Proc. Phys. Soc. 46 , 523 . 

Loomis, F. W. & Nusbaum, R. E. 1932 Phys. Rev. 38 , 1447 , 

Mulliken, R. S. 1937 J. Phys. Chem. 41 , 5 . 
v. Neumann, J. & Wigner, E. 1929 Phys. Z. 30 , 467 . 

Parker, A. E, 1933 Phys. Rev. 44 , 914 . 

Schmid, R. 1936 Z. Phys. 99 , 274 . 

Schmid, R. & Gero, L. 1935 Z. Phys . 96 , 546 . 

Schmid, R. & Gero, L. 1936 Z. Phys. 99 , 281 . 

Schmid, R. & Ger6, L. 1937 Z. Phys. 105 , 36 . 

Schmid, R., Gero, L. & Zempldn, J. 1938 Proc. Phys . Soc. 50 , 283 . 

Wager, A. T. 1943 Phys. Rev. 64 , 18 . 

Watson, W. W. <fe Koontz, P. G. 1934 Phys. Rev. 46 , 32 . 

White, J. U. 1940 J. Chem. Phys. 8, 459 . 

Wigner, E. & Witmer, E. E. 1928 Z. Phys. 51 , 859 . 

Ziel, A. van der 1934 Physica , Eindhoven , 1, 353 . 

Ziel, A. van der 1937 Physica , Eindhoven , 4 , 373 . 


The structure of polyisoprenes 
I. The crystal structure of geranylamine hydrochloride 

By G. A. Jeffrey 

(<Communicated by W. N. Haworth , F.B.S.—Received 25 May 1944) 

An X-ray analysis employing three-dimensional Fourier syntheses has established the crystal 
structure and molecular dimensions of the di-isoprene derivative, geranylamine hydro¬ 
chloride. The molecules, which have a trans configuration and are therefore analogous to gutta¬ 
percha, lie parallel and end to end in pairs within an ionic framework where each nitrogen 
atom is equidistant from four chlorine neighbours. The two isoprene units are planar and have 
normal interatomic distances, but are linked by a C-C bond markedly shorter than a normal. 
single bond. This unusual bond feature is accompanied by a coplanar arrangement with the 
adjacent carbon bonds. 


Introduction 

The difficulty of crystal structure determination of complex organic molecules is 
to some degree offset by the extent of the diffraction data. With the polyisoprenes 
much of this information is obscured both by the diffuse nature of the fibre diagrams 
and by ambiguities arising from imperfect crystallite orientation. The resulting 
lack of precision is well illustrated by the example of rubber, where attempts at 
direct analysis have led to considerable disagreement even in the preliminary 
description of the unit cell (Lotmar & Meyer 1936 ; Sauter 1937 ; Morss 1938 ; Misch 
& Van der Wyk 1940 ). The detailed structures for rubber and /?-gutta-percha 



389 


The structure of polyisoprenes 


recently proposed by Bunn ( 1942 ) arc subject to the same limitations and cannot be 
regarded as necessarily final solutions, particmlarly as they entail surprisingly large 
valency distortions of a type hitherto unparalleled. 

, With the object of determining indirectly the interatomic dimensions in these 
polymers attention has been directed to simpler analogues. Following the study of 
a mono-isoprene compound by Cox & Jeffrey ( 1942 ), derivatives of geraniol were 
considered'since they possess a di-isoprene system akin to the repeating units in the 
long-chain polymers. Although many of these compounds are unsuitable for X-ray 
methods, the hydrochloride of geranylamine ( 1 ) afforded the requisite crystals for 
intensity measurements, and their favourable space group together with the 
presence of a comparatively heavy chlorine atom has made possible the complete 
and accurate analysis of the structure: 


CH—C—CH—CRj—OHj—-C=OH—0H 4 —NH a .<Jl 


OH, 


OH, 


( 1 ) 


Experimental 

Four molecules of C 10 H 18 NH a . HC1 are associated by the space-group symmetry 
P 2x /c in a unit cell of dimensions a = 22 - 68 A, b — 5-94A, c = 8-98A, /? = 98-8°. 
A complete description of the structure excluding the hydrogen atoms therefore 
requires the determination of 36 atomic parameters. For this purpose, all plane 
reflexions within the range of Cu K a radiation were recorded on oscillation photo¬ 
graphs about the principal axes, and their relative intensities were estimated by eye 
with the aid of calibration charts. As it is important that these values should be 
absolute, they were related to selected intensities transmitted through a thin crystal 
slab and measured by photometric comparison with a standard rock-salt (400) 
reflexion recorded on the same film. All photographs were taken three or more 
times with different exposures, using aluminium screens of known absorption to 
bring strong spots within the range of linear X-ray density. 

From the experimental measurements the structure amplitudes of 1060 planes 
were calculated by the usual formulae, and were used first in the summation of a 
three-dimensional F z (Patterson) Fourier synthesis. 


The arrangement of the chlorine atoms 

The positions of the chlorine atoms were derived directly .from the Patterson 
synthesis by virtue of the outstanding Cl-Cl vector peaks, two at co-ordinates 
(0-107, 0-60, 0*162) and (0*107, 0-50, 0-662) and a third^(f double height at (0-00, 
0 - 00 , 0-50). These peaks must arise from atoms at either (0-053, 0*25, 0-081) or 
(0-053,0-25,0-331) in conjunction with atoms symmetrically related. The former was 
ultimately shown to be correct, but both possibilities had to be considered since no 
distinction can be made between the oentre of symmetry and the screw axis for 
atoms on the glide planes. 

26 


Vol. 183 . A. 



390 


G-. A. Jeffrey 

The Patterson synthesis, having symmetry planes at y = 0 and cannot reveal 
any small departure in the y parameter from 0-250. This was assumed to be exact 
for the approximate structure, and during the successive refinements no evidence 
to the contrary was observed. Further confirmation was available at the conclusion 
of the analysis from the comparison of the agreement between observed and cal¬ 
culated structure amplitudes for planes with l odd, where the chlorines make zero 
contribution, and those with l even: 

-'f cal0- - 17-0 and 17-8 % respectively. 

" I -*obs. I 

As shown in figures 4 and 5 these halogen atoms lie in sheets close to the (100) 
planes enclosing the organic molecules in pairs between them. 


DETERMINATION OB’ THE APPROXIMATE STRUCTURE 

The general orientation of the molecules was apparent from the length of the 
a axis; and from the weak odd layer lines,on the c axis photographs it was clear that 
other atoms besides the halogens must lie on or near the glide planes. Further 
information was derived from the Patterson synthesis which was computed over 
sections parallel to (010) at intervals of y sufficiently close to reveal all prominent 
peaks. Six such peaks of height corresponding to Cl-N and Cl-C distances indicated 
positions for N, and C 2 in which the nitrogen was equidistant from four chlorine 
atoms and the interatomic distances and valency angles were approximately normal. 
This was the only arrangement compatible with the vector maps. Beyond this stage, 
however, these became much more complicated and further direct interpretation 
was not possible. For the remaining eight atoms trial and error methods were 
adopted and the Patterson synthesis was useful only in restricting the number of 
possibilities to be considered. 

A model based on normal interatomic dimensions was used, first to ascertain 
arrangements which would satisfy the principal structure factors of the (hOl) zone, 
and then for further trials with general planes. Owing to the wide range of free 
rotation about each single bond, a large number of configurations was tried and 
rejected before arriving at a set of co-ordinates which, when refined by Fourier F 
synthesis, resulted in a marked improvement in the agreement between the observed 
and calculated structure factors. These and all subsequent syntheses were of the 
three-dimensional type, since projections were of little value even on the most 
favourable (010) plane. 


* The Fourier F synthesis 

Starting from the approximate structure, the atomic co-ordinates were refined 
by the method of successive Fourier syntheses. The summations were evaluated 
from all the available data using Beevers and Lipson’s strips over sections and along 
lines at values of x, y and z appropriate to the maximum of each atom peak. After 



391 




392 


G. A. Jeffrey 

i;he fifth senes, the structure-factor calculations revealed no further sign changes 
and the electron maps from this final summation are shown in figures 1 and 2. For 

tak^n fr P ° S ti? f m ! Stra1 ; i0n ’ th ® electron density peaks for one molecule have been 
the sectlonal summations and superimposed on the b and c planes respec- 

e T l are t* intervals of one eIectron per A3 ’ tbe option of 

btn^7f 0m , f t SCale iS tWO electrons - Tbe zero and first contours have 
n omitted for clarity in the diagrams. These maps show clearly that the peaks are 

^ftefv f ° r i Ca ^? f t0 ? S at ° PP ° Site mds ° f the molecule and although P approxi- 
the^nfr m + f tal 7° Iume th& y become Progressively flatter for eachttom along 
- . fi ° m 6 mtrogen ‘ ^Tnle it cannot be certain that this has a real physical 
significance and does not arise merely from the use of a finite Fourier seSef tle 

mXn bttwll tb ffeCt i SUSge f S ^ be attributed t0 a Nation in thermal 
motion between the polar and non-polar ends of the molecule. 


The final atomic parameters 

■ J7 eatomic Parameters were measured directly from the maximum of each 
he Fourier synthesis and gave the interatomic dimensions shown in table 1. 


N-Ci 

c r c 2 

C 2 -C 3 

C*-C 4 

C 3 -C s 

N-Ci-C, 

Ci-C 2 -C, 

Cg-C 3 -C 4 

C 2 -C 3 -C 5 


1-49 A 

1-54 

1*31 

1-53 

1*51 

109° 

126° 

124° 

121 ° 

115° 


Table 1 


(1-48) 

<VC 6 

1-44A 

(1-53) 

C 6 -C 7 

1*51 

d-32) 

C 7 -C 8 

1*31 

(1-53) 

c 8 -c 9 

1-53 

(1-54) 

c.-c 10 

1-54 

(109°) 

Ca-C 5 -C 6 

115° 

(126°) 

Cj-Qj-Cf 

112° 

(124°) 

C 6 -C 7 -C 8 

129° 

(119°) 

C 7 -C 8 -C 9 

124° 

(117°) 

C 7 “C 8 -C 10 

121° 


<Vc 8 -c 10 

115° 


(1-45) 

(1*52) 

(1-32) 

(1*55) 

(1*53) 

( 112 °) 

( 112 °) 

(129°) 

(123°) 

( 121 °) 

(116°) 


and in piece of the oomp“ XtmoIT" *" ^ 

with the paper but not m,hli«w + u- 1 * 1060 values ’ whlch w as submitted 

figure 3 for some 500 values. ’ S agreement ls sboTO diagrammatically in 

wi t: rewl r tw ° f 

number of observed intensities for the 7 & & US6d previousl y contained a greater 
reason of the complfi^w 5* * Senes of P la “* *han for the l odd by 



The structure of polyisoprenes 393 

F values (c. 350) where the difference between F 0bservea and F morine was less 
than 1-5 per molecule, this value being the minimum observable when Chlorine 
was zero. The new atomic positions showed a mean shift of 0-02 A from those 
previously derived. The corresponding interatomic distances are given in paren¬ 
thesis in table 1. 

As an alternative method of balancing the l even and l odd terms, the F values 
were calculated for all planes of spacing greater than 0- 9 A whose intensities were too 
small to be observed. These amounted to some 300 values which when included in 
the synthesis also resulted in shifts in the parameters of the order of 0-02 A and, as 
might be expected, considerably sharpened the electron peaks. The double bond 
lengths became 1-34A, the C 5 -C 6 bond 1-46A, and the other bonds remained the 
same within 0-01 A. The progressive broadening of the contour intervals for each 
atom along the molecule was still clearly apparent, although the additional terms 
increased the maximum electron density of the peaks other than those at y = £ and 
| by about | electron/A s . 

Both these Fourier syntheses were an improvement on the original as regards 
the shape of the electron contours, particularly those of C s where the elongation of 
the peak in the c axis direction was completely removed. The changes in peak 
positions were of the same order as those between the penultimate and final suc¬ 
cessive Fourier syntheses, and the bond lengths and valency angles showed no 
variations beyond the limits of experimental errors which are estimated at ± 0-04 A 
and ±4°. 

Although the atom peaks were least sharp in the synthesis using the smaller 
number of F values, these atomic parameters (table 2) are considered to be the most 
reliable since they were derived solely from experimental data with the exclusion, 
for the organic part of the molecule, of such values as might impose a systematic 
error on the positions of the maxima. 


Table 2 



X 

y 

z 


X 

y 

z 

Cl 

0*053 

0*250 

0*080 

C 5 

0*253 

0*625 

' 0*182 

N 

0*049 

0*750 

0*216 

C„ 

0*304 

0*700 

0*287 

<k 

0*103 

0*750 

0*333 

C 7 

0*357 

0*746 

0*207 

c* 

0*158 

0*751 

0*255 

C 8 

0*394 

0*916 

0*212 

C 3 

0*198 

0*593 

0*264 

c„ 

0*447 

0*918 

0*118 

c« 

0*198 

0*379 

0*358 

C 10 

0*392 

1*120 

0*315 


The STEKEOCHEMISTB.Y OF THE MOLECULE 

Within experimental error the two isoprene units are identical, the parameters of 
Cj^-Cj being more certain than C 6 -C 10 owing to the flattening of the Fourier peaks 
described above. They possess normal bond lengths (C=C 1-34A, C—0 1*54A) 
and the disposition of valencies about the double bonds is planar. Their angles 
appear to be distorted by about 5° from the theoretical 120° with the result that 








395 


The structure of polyisoprenes 

C^Ca is not exactly parallel to C 3 -C 6 , nor C 6 -C 7 'to C 8 -C 9 . This variation, although 
small and very near the margin of error, may he real and characteristic of the 
unsymmetrical arrangement of the groups about the double bond. 

•The bond C 6 -C 6 linking the isoprene units is shorter than that associated with a 
normal C-C bond by an amount two or three times the probable experimental error. 
Furthermore the distribution of the adjoining C 3 -C 5 and C e -C 7 is planar. These 
features,’ reminiscent of those found in conjugated molecules, seem inexplicable in 
terms of the usual theory of mesomerism. As reported by Bateman & Jeffrey ( 1943 ), 
the amine hydrochloride group does not appear to be directly responsible and the 
hybrid character may result from a hyperconjugation process in which the a- 
methylenic C-H electrons become partially localized in the central bond: 

H H 

—CH=C— b*=h —CH=C— 

I ! ! I 

CH a H H CH 3 

Evidence as to what extent this can be regarded as a general property of the poly- 
isoprene system must await the examination of compounds with a similar structural 
modification. 

Other stereochemical features of importance in the geranylamine structure 
concern the angular position of the C 6 -C 6 bond relative to the two isoprene planes. 
Free rotation about the bond 0 3 -C 6 will be influenced by repulsion of the CH a group 
by the CHandCH 3 . If the CH 2 group lies in the plane of the isoprene (C x ... C 6 ),then 
the approach between one or other pair of these atoms would be of the order of 
2*8 A; a distance which in the absence of any special bonding between the hydrogen 
atoms provides an appreciable energy barrier. In the observed structure the C 6 -C 6 
bond is oriented at 80° to the isoprene plane and the steric repulsions balance at 
almost equal distances (C 6 -C 2 3-28A, C 6 -C 4 3-24A). In contrast, since there is no, 
methyl addendum to C 7 , the rotation about C 6 -C 7 relative to the other isoprene 
plane is influenced solely by the approach of the OH 2 to the CH^). Over a large 
angular range, this repulsion will be ineffectual and the factors deter mining the 
observed angle of 45° between the C 8 -C 6 bond and the C 6 ... C 10 plane must arise 
mainly from the packing of adjacent molecules in the crystal lattice, and probably 
from the relation between the terminal C(CH 3 ) 2 groups in particular. Whereas this 
latter steric effect is specific to the particular intermodular arrangement, the 
former depends upon intramolecular forces and, since it will apply to any molecule 
with the same configuration, has an important bearing on the stereochemistry of 
the long-chain polymers. 

The INTERMOLECULAR ARRANGEMENT 

The packing of the molecules in the crystal lattice is illustrated in figures 4 and 5* 
The hydrocarbon chains lie parallel with the molecules end to end in pairs. Each 
nitrogen is associated with four chlorine atoms at nearly equal distances. 



G. A. Jeffrey 



Figure 4. The structure of geranylamine hydrochloride. 



Figure 5. Projection of the structure on (010). 



397 


The structure of polyisoprenes 

All other distances between atoms in adjacent molecules conform to Van der 
Waal’s forces and are greater than 3*75 A with the exception of 3-60A between 
centrosymmetrically related C 9 atoms. 

Table 3 

N-Cli 3-24 A N-C1 3 3-24 A 

N-C1 2 3*17 N-C1 4 3*24 

Acknowledgements 

I am grateful to Professor Sir Lawrence Bragg and Dr E. G. Cox for advice and 
criticism and to Professor W. N. Haworth for facilities granted at Birmingham 
University for the earlier part of the work. Thanks are also given to Dr D. A. Sutton 
for preparation of the geranylamine hydrochloride. 

The above work has been carried out as part of a programme of fundamental 
research undertaken by the Board of the British Rubber Producers’ Research 
Association. 


Reeekenoes 

Bateman & Jeffrey 1943 Nature , 152, 446. 

Beevers & Lipson 1936 Proc. Phys. Soc . 48, 772. 
Bunn 1942 Proc. Roy. Soc. A, 180, 40. 

Cox & Jeffrey 1942 Trans. Faraday Soc . 38, 241. 
Lotmar & Meyer 1936 Mh. Chem. 69, 115. 

Misch & Van der Wyk 1940 J. Chem. Phys. 8 , 127. 
Morss 1938 J. Amer. Chem. Soc . 60, 273. 

Sauter 1937 Z. Phys. Chem. 36 B, 405. 


For the tables of calculated and observed structure factors see pages 398-404. 



398 


G. A. Jeffrey 


hhl 

100 

200 

300 

400 

500 

600 

700 

800 

900 

1000 

1100 

1200 

1300 

1400 

1500 

1600 

1700 

1800 

1900 

2000 

2100 

2200 

2300 

2400 

002 

102 

202 

302 

402 

502 

602 

702 

802 

902 

1002 

1102 

1202 

1302 

1402 

1502 

1602 

1702 

1802 

1902 

2002 

2102 

2202 

2302 

2402 

2502 


& calc. 
48 
17 
30 
— 25 
45 
-46 
-55 
-73 
-71 
-41 
-32 
-42 
-17 
-18 
9 

21 

14 

23 

22 

41 

10 

0 

3 

-8 

-111 

19 

62 

-47 

-18 

-108 

-49 

-44 

-42 

-33 

-15 

28 

14 

26 

29 

27 

23 

21 

13 

1 

-13 

2 

3 

-10 

-5 

-14 


Calculated and observed structure factors 


-^obs. 

hid 

48 

■ T02 

16 

202 

41 

502 

33 

402 

50 

502 

56 

502 

76 

702 

82 

802 

74 

502 

53 

1002 

25 

IT02 

47 

1502 

16 

1502 

18 

1402 

<8 

1502 

32 

1502 

15 

1702 

24 

1502 

27 

1502 

47 

5502 

10 

5102 

<8 

5502 

<8 

5502 

7 

130 

10 

5102 

5502 

89 

004 

54 

104 

27 

204 

109 

304 

66 

1 404 

34 

504 

47 

604 

39 

704 

<8 

804 

38 

904 

32 

1004 

31 

1104 

34 

1204 

34 

1304 

21 

1404 

21 

1504 

9 

1604 

<8 

1704 

12 

1804 

<8 

1904 

<8 

2004 

<8 

2104 

10 

2204 

11 

2304 



hhl 

T04 

104 

204 

404 

504 

204 

704 

204 

904 

1204 

1104 

1204 

T204 

1404 

1504 

T204 

1704 

1204 

1904 

2004 

2104 

2204 

2304 

2404 

2504 

006 
106 
206 
306 
406 
506 
606 
706 
806 
906 . 
1006 
1106 
1206 
1306 
1406 
1506 
1606 

T06 

206 

206 

406 

506 

206 

706 


F <calc. 

5 
9 

29 

18 

39 

42 

43 
24 

-38 

6 
4 

-12 

-14 

-13 

-25 

-24 

-14 

-17 

-1 

-1 

6 

8 

10 

7 

6 

-27 


-18 

-12 

-4 

-8 

12 

22 

17 
23 

18 
12 

5 

5 

-1 

-1 

-23 

-18 

-5 

0 

3 

20 

12 


-^obs. 

<8 

13 

25 

16 

38 

42 

45 

6 

54 

6 

<8 

13 

13 
12 
18 
19 
12 

14 
<8 

5 

7 

10 

10 

9 

8 

23 

27 

16 

18 

12 

10 

13 
12 
18 

14 
18 
16 

9 

<8 

<8 

<8 

<8 

14 

14 
<8 
<8 
<8 

15 
12 



The structure of polyisoprenes 


Calculated and observed structure factors 


hkl 

F calc. 

Fobs. 

506 

34 

39 

506 

42 

48 

1506 

26 

22 

TT06 

23 

21 

1206 

15 

10 

1506 

5 

6 

1106 

-22 

22 

1506 

-7 

14 

1506 

-11 

17 

1706 

-14 

16 

1506 

-15 

15 

1506 

-17 

16 

2506 

-8 

8 

2106 

-5 

<8 

2206 

-2 

<8 

2306 

2 

<8 

2106 

0 

<8 

008 

-12 

6 

108 

3 

5 

208 

-1 

<8 

308 

4 

5 

408 

10 

8 

508 

17 

14 

608 

24 

16 

708 

9 , 

8 

808 

4 

12 

908 

13 

12 

1008 

2 

<8 

1108 

6 

12 

108 

-16 

9 

208 

-17 

12 

208 

-37 

28 

108 

-16 

18 

508 

-10 

<8 

508 

-7 

<8 

708 

3 

<8 

508 

-4 

<8 

508 

9 

<8 

1508 

10 

<8 

1108 

16 

14 

1208 

6 

<8 

1508 

10 

10 

1108 

18 

15 

1508 

8 

8 

1508 

2 

<8 

1708 

1 

<8 

1508 

-1 

<8 

110 

60 

75 

210 

-26 

23 

310 

-20 

12 


hkl 

F calc. 

F obs. 

410 

-21 

18 

510 

-56 

39 

610 

-65 

62 

710 

-26 

25 

810 

-20 

9 

910 

7 

14 

1010 

6 

11 

1110 

10 

11 

1210 

33 

33 

1310 

31 

33 

1410 

40 

43 

1510 

18 

15 

1610 

6 

<8 

1710 

11 

18 

1810 

5 

7 

1910 

-15 

16 

2010 

1 

<8 

2110 

0 

<8 

2210 

-12 

12 

2310 

-8 

8 

2410 

-8 

8 

Oil 

18 

21 

111 

15 

22 

211 

49 

50 

311 

-80 

80 

411 

8 

<8 

511 

10 

15 

611 

11 

18 

711 

16 

16 

811 

-33 

41 

911 

1 

<8 

1011 

4 

7 

1111 

4 

<8 

1211 

4 

<8 

1311 

-3 

<8 

1411 

-7 

7 

1511 

2 

<8 

1611 

0 

<8 

1711 

10 

10 

1811 

-4 

<8 

1911 

-8 

7 

Til 

-31 

29 

211 

-44 

46 

311 

51 

58 

ill 

-3 

<8 

Ill 

5 

•K, 8 

Ill 

-4 

5 

711 

-22 

20 

Bll 

0 

<8 

Ill 

17 

21 

mi 

-3 

<8 


hkl 

mi 

im 

mi 

im 

T511 

mi 

1711 

mi 

012 
112 
212 
312 
412' 
512 
612 
712 
812 
912 
1012 
1112 
1212 
1312 
1412 
1512 
1612 
1712 
1812 
1912 
2012 
2112 
2212 
2312 
2412 
2512 

T12 

212 

212 

112 

212 

212 

712 

212 

212 

TU12 

TT12 

T212, 

1312 

1112 

1312 

1212 

1712 


F calc. 

4 

-20 

2 

11 

-1 

3 

15 

7 

-18 

-146 

-79 

-59 

-47 

14 

-17 

-17 

11 

14 

15 
40 
29 
20 
11 

16 
-7 

-10 
-15 
i -24 
-7 
-8 
-9 
-4 
-3 
4* 

7 

5 
17 
51 
70 
49 
55 
62 
37 
11 

6 
3 

-13 

-15 

-16 

-33 

-30 


399 


F obs. 
<8 
22 
<8 
12 
<8 
<8 
14 
8 

13 

140 

84 

60 

43 

20 

17 

20 

<8 

9 

<8 

37 

29 

19 
11 
15 

9 

* <8 
<8 
14 
8 
9 

<8 

<8 

<8 

<8 

8 

<8 

25 

64 

76 

55 

59 

62 

32 
11 

5 

12 

18 

17 

20 

33 
33 



400 


hkl 

1812 

1212 

2012 

2112 

2212 

2212 

2¥l2 

2512 

013 

113 

213 

313 

413 

513 

613 

713 

813 

913 

1013 

Tl3 

213 

213 

113 
213 
213 
713 
213 

213 
1013 
TT13 
T213 
T213 
1413 
1513 

1213 
1713 

014 

114 

214 
814 
414 
514 
614 
714 
814 
914 

1014 

1114 

1214 
1314 


G. A. Jeffrey 


Calculated and observed structure factors 



hkl 

1414 

1514 

1614 

1714 

1814 

1914 


Tl4 

214 

214 

414 

214 

614 

714 

814 

214 

1214 

TT14 

1214 

1214 

1414 

1514 

1214 

1714 

1214 

1214 

2214 

2T14 

2214 

2214 


015 

115 

215 

315 

415 

515 

615 

715 

815 


115 

215 

215 

415 

515 

215 

715 

815 

215 

1215 



hkl 

^calc. 

016 

2 

116 

-1 

216 

18 

316 

30 

416 

27 

516 

29 

616 

8 

716 

3 

816 

14 

916 

3 

1016 

— 6 

1116 

— 7 

1216 

— 5 

1316 

— 13 

1416 

i ~13 

1516 

' -11 

1616 

-12 


He 

-6 

216 

-14 

516 

-17 

il6 

-40 

516 

-28 

516 

-22 

716 

-8 

516 ■ 

-10 

516 

-8 

1516 

-5 

TT16 

10 

1216 

13 

T5I6 

10 

1116 

16 

1516 

20 

1516 

14 

1716 

9 

1816 

4 

1516 

-2 

2516 

-4 

2116 

-3 

2216 

-5 

2516 

-5 

2116 

-11 


018 

8 

118 

11 

218 

10 

318 

13 

418 

9 

518 

6 

618 

8 

718 

6 


F obs. 


<8 

<8 

15 

24 

25 
21 

<8 

<8 

<8 

<8 

6 

6 

6 

9 , 

10 
11 
8 


<8 

<8 

17 
31 
26 

18 
<8 
<8 
<8 
<8 
<8 

8 

8 

16 

17 

10 

8 

<8 

<8 

<8 

<8 

<8 

7 

8 


8 

9 

8 

7 

7 

7 

6 



hkl 

Tl8 

218 

518 

418 

518 

518 

718 

518 

518 

1518 

TT18 

1518 

T518 

T418 

1518 

1518 

T718 

1818 

020 

120 

220 

320 

420 

520 
620 

720 
820 

920 
1020 
1120 
1220 
1320 
1420 
1520 
1620 
1720 
1820 
1920 

021 

121 

221 

321 

421 

521 
621 

721 
821 

921 
1021 
1121 
1221 


The structure of polyisoprenes 401 


Calculated and observed structure factors 


& calc. 

F <obs. 

hkl 

F calc. 

-^obs. 

hkl 

F calc. 

F obs. 

8 

<8 

1321 

-2 

<8 

522 

— 45 

45 

5 

<8 

1421 

8 

8 

622 

-30 

32 

-9 

<8 

1521 

-2 

<8 

722 

-14 

15 

2 

<8 

1621 

-2 

<8 

822 

-8 

<8 

— 9 

<8 

1721 

2 

<8 

922 

8 

<8 

-10 

8 

1821 

7 

<8 

1022 

13 

12 

-16 

14 

1921 

-1 

<8 

TT22 

33 

30 

-24 

17 

2021 

-4 

<8 

TS22 

32 

32 

-12 

9 

2121 

2 

<8 

1322 

34 

35 

-3 

<8 




H22 

34 

32 

-5 

<8 

T21 

-4 

<8 

1622 

0 

<8 

2 

<8 

521 

33 

35 

1622 

7 

<8 

8 

9 

521 

-34 

37 

1722 

1 

<8 

9 

9 

421 

-8 

<8 

1822 

-6 

<8 

9 

10 

521 

4 

8 

T522 

-2 

<8 

7 

' <8 

521 

12 

13 

2022 

-6 

6 

10 

9 

721 

4 

8 

2T22 

-10 

9 

2 

<8 

521 

-13 

13 

2222 

-10 

8 



521 

5 

<8 

2522 

-7 . 

9 

-110 

103 

1521 

-2 

<8 




-92 

97 

TT21 

0 

<8 

023 

19 

17 

-40 

43 

T521 

0 

<8 

123 

3 

<8 

-47 

34 

1521 

1 

<8 

223 

8 

<8 

8 

8 

1421 

-2 

<8 

323 

-31 

25 

33 

32 

1521 

1 

<8 

423 

10 

8 

4 

10 

T521 

-8 

11 

623 

9 

13 

36 

34 




623 

4 

10 

39 

39 

022 

11 

15 

723 

2 

<8 

47 

49 

122 

-7 

11 

823 

-16 

11 

47 

50 

222 

0 

<8 

923 

11 

11 

32 

32 

322 

21 

25 




20 

20 

422 

24 

21 

T23 

-2 

<8 

12 

8 

522 

24 

* 30 

223 

-32 

29 

0 

<8 

622 

70 

64 

523 

10 

<8 

8 

8 

722 

44 

46 

123 

8 

8 

-9 

9 

822 

29 

27 

623 

0 

<8 

-25 

26 

922 

24 

16 

623 

0 

<8 

-19 

24 

1022 

-5 

9 

723 

-5 

<8 

-28 

30 

1122 

-6 

12 

523 

-3 

<8 



1222 

-12 

16 

623 

1 

<8 

-15 

17 

1322 

-20 

21 

1023 

0 

<8 

-10 

14 

1422 

-23 

25 

TT23 

14 

15 

5 

<8 

1522 

-29 

29 

1523 

-15 

13 

8 

<8 

1622 

-22 

20 

T¥23 

-4 

<8 

-13 

8 

1722 

-10 

7 

Ti23 

5 

<8 

4 


1822 

-8 

<8 

1523 

1 

<8 

-6 

9 

1922 

0 

<8 

1623 

8 

8 

-4 

8 

2022 

5 

<8 

1723 

-13 

12 

-3 

<8 







3 

<8 

T22 

-16 

7 

024 

2 

<8 

-4 

9 

522 

-36 

27 

124 

27 

18 


<8 

522 

-35 

37 

224 

33 

29 

6 

<8 

122 

-63 

61 

324 

29 

25 



m 


G. A. Jeffrey 

Calculated and observed structure factors 


hkl 

, F ' calc . 


424 

32 

25 

524 

24 

16 

624 

1 

<8 

724 

1 

<8 

824 

7 

<8 

924 

-10 

<8 

1024 

1 

<8 

1124 

-9 

8 

1224 

-23 

13 

1324 

-17 

11 

1424 

-15 

9 

1524 

-14 

8 

1624 

-4 

<8 

1724 

-4 

<8 

1824 

0 

<8 

1924 

2 

<8 


124 

1 

<8 

224 

-15 

16 

324 

-21 

22 

124 

-28 

20 

524 

-32 

26 

324 

-29 

24 

724 

-31 

26 

524 

-19 

10 

924 

0 

<8 

1324 

7 

9 

TT 24 

-5 

<8 

1524 

15 

15 

1324 

4 

<8 

1124 

9 

<8 

1524 

27 

21 

1324 

18 

15 

1724 

15 

12 

1524 

10 

8 

1324 

3 

<8 

5324 

3 

<8 

5 T 24 

-3 

<8 

5524 

-7 

9 

5324 

-7 

8 

5124 

-9 

9 

3324 

-8 

8 


025 

-7 

7 

125 

0 

<8 

225 

-4 

<8 

325 

12 

8 

425 

-5 

<8 

525 

-12 

8 

625 

2 

<8 

725 

5 

6 

825 

12 

8 


hkl 

F calc . 

- Fobs . 

T 25 

-8 

8 

525 

-4 

<8 

325 

7 

<8 

125 

-4 

<8 

325 

0 

<8 

325 

-16 

9 

725 

5 

<8 

525 

7 

<8 

525 

3 

<8 

1325 

— 2 

<8 

TT 25 

-18 

9 

1525 

-14 

11 

026 

24 

19 

126 

26 

20 

226 

21 

17 

326 

21 

14 

426 

10 

10 

526 

7 

7 

626 

4 

<8 

726 

-7 

<8 

826 

-20 , 

14 

926 

-22 

17 

1026 

-23 

21 

1126 

-21 

17 

1226 

2 

<8 

1326 

-7 

<8 

126 

19 

17 

526 

14 

12 

326 

4 

<8 

126 

2 

<8 

526 

-6 

6 

626 

-14 

‘ 13 

726 

-19 

16 

526 

-20 

24 

526 

-38 

32 

1026 

-28 

28 

1126 

-16 

17 

1526 

-16 

8 

1326 

5 

8 

1426 

8 

11 

1326 

8 

9 

1326 

10 

9 

1726 

13 

12 

1526 

12 

13 

1526 

12 

12 

5326 

6 

7 

028 

10 

8 

128 

2 

.<8 

228 

-2 

<8 

328 

-6 

6 

428 

-9 

7 

528 

-12 

9 


Tiki F calc . F obs . 

628 -12 12 

728 -11 9 

T 28 12 % 8 

528 18 13 

528 21 15 

128 18 13 

528 9 8 

528 8 7 

728 -1 <8 

528 -10 17 

528 6 8 

1528 -6 <8 

TT 28 -8 8 

1528 -8 8 

*1528 -13 11 

1428 -10 9 

T 528 -6 <8 

T 528 -4 <8 

T 728 -2 <8 

T 828 -4 <8 

130 1 <8 

230 11 8 

330 18 10 

430 15 15 

530 45 37 

630 30 21 

730 17 11 

830 10 8 

930 4 <8 

1030 -4 <8 

1130 -16 15 

1230 -18 18 

1330 -21 18 

1430 -25 23 

1530 -25 20 

1630 - 2 <8 

031 -3 <8 

131 -14 13 

231 -3 <8 

331 13 9 

531 -4 <8 

431 -3 <8 

631 -2 <8 

731 -4 <8 

831 9 11 

931 ' 2 <8 

1031 -4 <8 

1131 0 <8 

1231 -1 <8 

T 31 8 <8 

531 5 <8 



The structure of polyisoprenes 403 


Calculated and observed structure factors 


Ml 

F calc . 

& obs . 

Ml 

F calc . 

obs . 

Ml 

F calc . 

F obs . 

331 

14 

12 

2532 

10 

9 

T 36 

8 

8 

131 

— 3 

<6 

5132 

0 

<8 

236 

11 

9 

53 JL 

-4 

<6 




536 

15 

13 

531 

-2 

<6 

034 

17 

17 

436 

21 

13 

731 

5 

7 

134 

25 

20 

536 

22 

11 

831 

# 6 

5 

234 

21 

13 

536 

19 

12 

331 

— 4 

<8 

334 

0 

<8 

736 

5 

8 

1531 

— 3 

<8 

434 ' 

-7 

7 

536 

1 

<8 

1131 

0 

<8 

534 

-14 

7 

536 

1 

<8 

1231 

7 

<8 

• 634 

-4 

<8 

1536 

-5 

<8 

1331 

2 

<8 

734 

-38 

29 

TT 36 

4 

<8 

1131 

-2 

<8 

834 

-34 

34 

1236 

-5 

<8 

1531 

-2 

<8 

934 

-29 

29 

1536 

-10 

8 




1034 

-28 

16 

1436 

-16 

10 

032 

54 

48 

1134 

2 

<8 

1536 

-15 

10 

132 

38 

31 

1234 

-6 

<8 

T 536 

-5 

<8 

232 

39 

31 

1334 

-3 

<8 




332 

38 

32 

1434 

4 

<8 

038 

-3 

<8 

432 

36 

34 

1534 

6 

<8 

138 

-4 

<8 

532 

10 

10 

1634 

13 

11 

T 38 

-2 

<8 

632 

— 9 

8 

1734 

15 

19 

238 

2 

<8 

732 

9 

8 

1834 

8 

6 

, 538 

-2 

<8 

832 

-11 

8 




¥38 

4 

<8 

932 

-11 

8 

T 34 

33 

28 

538 

8 

8 

1032 

-17 

16 

534 

30 

28 

538 

10 

8 

1132 

-26 

23 

534 

30 

35 

738 

3 

<8 

1232 

-25 

21 

¥34 

27 

30 

538 

12 

8 

1332 

-18 

17 

534 

11 

13 

538 

13 

10 

1432 

-12 

8 

534 

1 

<8 

1538 

5 

10 

1632 

-2 

<8 

734 

-6 

7 

TT 38 

3 

8 

1632 

-7 

<8 

534 

-16 

14 




1732 

-4 

<8 

534 

-8 

14 

040 

29 

* 23 

1832 

7 

<8 

T 534 

-39 

39 

140 

39 

30 

1932 

6 

<8 

IT 34 

-36 

42 

240 

31 

27 




1234 

-29 

30 

340 

22 

15 

132 

9 

15 

T 534 

-26 

20 

440 

-4 

<8 

232 

7 

<8 

I ¥34 

- 9 

<8 

540 

19 

14 

532 

-24 

31 

1534 

4 

<8 

640 

-1 

<8 

132 

-31 

39 

1534 

-2 

<8 

740 

-17 

9 

532 

-13 

15 

1734 

7 

9 

840 

-20 

16 

532 

-37 

34 

T 534 

8 

9 

940 

-22 

17 

732 

-38 

42 

1534 

13 

13 

1040 

-33 

34 

532 

-37 

39 

. 2534 

9 

9 

1140 

-21 

22 

532 

-26 

31 

2 T 34 

7 

<8 

1240 

-13 

18 

1532 

-10 

8 

V* 



1340 

-6 

8 

1132 

-4 

<8 

036 

1 

<8 




1232 

-4 

<8 

136 

-6 

6 

042 

9 

8 

1532 

11 

8 

236 

-16 

11 

142 

-2 

<8 

1132 

-13 

12 

336 

-18 

13 

242 

-6 

10 

1532 

16 

8 

436 

-16 

14 

342 

-13 

9 

1532 

28 

26 

536 

-20 

12 

442 

-9 

8 

1732 

16 

14 

636 

-15 

<8 

542 

-10 

13 

1532 

21 

23 

736 

-5 

<8 

642 

-44 

36 

1532 

10 

19 

836 

-3 

<8 , 

742 ** 

-28 

28 



404 G. A. Jeffrey 


Calculated and observed structure factors 


hkl 

F icalc . 

-^ obs . 

hkl 

F icalc . 

& obs . 

hkl 

& calc . 

-^ obs . 

842 

—16 

14 

041 

5 

<8 

854 

16 

14 

942 

-10 

8 

141 

-5 

<8 

954 

14 

12 

1042 

3 

<8 

241 

0 

<8 

1054 

10 

20 

1142 

-3 

<8 

341 

9 

9 




1242 

6 

<8 

441 

-6 

<8 

154 

*15 

12 

T 42 

14 


541 

-1 

<8 

254 

-18 

16 

9 

141 

-8 

<8 

§54 

-16 

15 

142 

18 

14 

241 

4 

<8 

454 

-11 

9 

142 

23 

20 

§41 

0 

<8 

554 

— 5 

<8 

142 

27 

24 

441 

2 

<8 

§54 

-6 

<8 

142 

642 

742 

842 

26 

18 

10 

3 

24 

16 

8 

<8 

150 
• 250 

4 

-10 

<8 

8 

754 

§54 

§54 

4 

4 

8 

<8 

<8 

<8 

§42 

-10 

8 

350 

-7 

8 




1042 

-8 

8 

450 

-20 

14 

060 

-22 

20 

TT 42 

-13 

10 

550 

-23 

14 

160 

-11 

14 

1242 

-19 

13 

650 

-13 

9 

260 

-11 

10 


750 

-9 

8 

360 

-5 

6 

044 

-2 

<8 

850 

-7 

8 

460 

3 

<8 

144 

-14 

9 

950 

-4 

<8 

560 

-3 

<8 

244 

-23 

14 

1050 

1 

<8 

660 

4 

<8 

344 

-17 

16 

1150 

8 

<8 

760 

8 

9 

444 

-22 

14 




860 

11 

11 

544 

~ l T 

8 

052 

-18 

18 

960 

10 

13 

644 

2 

<8 

152 

-28 

23 

1060 

9 

11 

744 

/ -1 

<8 

252 

-18 

19 




844 

-4 

<8 

352 

-20 

15 

062 

4 

<8 

944 

. 2 

<8 

452 

-14 

11 

162 

-1 

<8 

1044 

0 

<8 

552 

-3 

<8 

262 

-4 

<8 

144 

-2 

o 

<8 

rj 

652 

752 

-4 

1 

<8 

<8 

362 

462 

6 

0 

<8 

<8 

5244 

544 

444 

o 

7 

18 

1 

9 

15 

852 

3 

<8 

562 

662 

13 

13 

9 

9 

544 

25 

16 

152 

. —8 

<8 

762 

8 

8 

344 

16 

14 

252 

-2 

<8 




744 

13 

8 

§52 

8 

8 

T 62 

-7 

<8 

344 

6 

8 

452 

7 

8 

262 

-7 

<8 

944 

6 

8 

§52 

8 

9 

§62 

-13 

10 

1344 

-2 

<8 

§52 

20 

15 

462 

-14 

11 

TT 44 

-4 

<8 

752 

18 

17 

§62 

-7 

- 7 

T 244 

-7 

8 

§52 

16 

16 





§52 

11 

11 

064 

-1 

<8 

046 

-8 

12 

T §52 

2 

<8 

164 

2 

<8 

146 

-13 

11 

1152 

6 

<8 « 

264 

12 

8 

246 

-10 

8 




364 

4 

<8 

T 46 

-13 

11 

054 

-11 

8 

464 

5 

<8 

546 

-9 

9 

154 

-8 

8 

164 

-2 

<8 

346 

-9 

7 

254 

-9 

8 

264 

-3 

<8 

146 

0 

<8 

354 

-5 

<8 

§64 

-3 

<8 

546 

7 

<8 

454 

4 

<8 

164 

-1 

<8 

346 

5 

<8 

554 

1 

<8 

564 

-11 

8 

746 

8 

14 

654 

8 

<8 

§64 

-7 

6 

546 

10 

** 16 - 

754 

15 

10 

764 

-4 

<8 



The statistical theory of the strength of bundles 
of threads. I 

By H. E. Daniels, Ph.D.. Wool Industries Research Association 
(i Communicated by H. Jeffreys, F.R.S.—Received 31 May 1944) 

A group of parallel threads of equal length, clamped at each end so that all threads extend 
equally under tension, is called a bundle, and the maximum load which the bundle can sup¬ 
port is called its strength. The object of the work is to study the probability distribution of 
the strength of bundles whose constituent threads are sampled randomly from an infinite 
population of threads in which the probability distribution of strength is known. 

The relation between the strength of a bundle and the strengths of its constituent threads 
is first discussed, and results are stated for bundles so large that the proportions of threads 
of different strengths approach their expectations. The properties of the probability dis¬ 
tribution of bundle strength are next developed in detail, attention being confined in the 
present paper to the case where all threads have the same load-extension curve up to breaking 
point. Finally, the asymptotic behaviour of the distribution for large numbers of threads 
is studied, and it is shown that in the commonest cases the distribution tends to assume 
the normal form. 


Introduction 

1 . Consider an infinite population of threads of equal length whose probability 
distribution of strength is known. A number of threads are sampled randomly from 
this population, laid side by side and clamped at each end to form what we term 
a bundle. When a free load is applied to the bundle all threads extend by an equal 
amount, and the minimum load beyond which all the threads of the bundle break is 
called its strength. The present work is an investigation into the probability distri¬ 
bution of the strength of such bundles. 

The problem presents itself naturally in the theory of strength testing of textile 
materials. For example, a test widely used in practice for estimating the strength of 
yams is known as the ‘ hank ’ test for wool yarns, and the ‘ lea ’ test for cotton yams. 
The method employed is to reel a hank of yarn containing a stated number of turns 
of specified circumference and to apply the breaking load to the hank by stretching 
it between two hooks. The word‘bundle’ has been used here in preference to ‘hank’ 
orj^a’, since there is usually a small amount of slip at the hooks which makes the 
jibnditions of this test indeterminate. The testing of cloth samplesfor warp and weft 
strength is another instance of a practical test in which the specimens approximate 
to bundles, though in closely woven fabrics the cross-threads afford a measure of 
support which introduces complications. Other examples outside the field of textile 
testing could no doubt also be quoted. 

2 . The subject was first considered, as far as the author is aware, by 3?. T. Peirce 
( 1926 ), relevant experimental work being published in previous papers in the same 
series. He deals exhaustively with the underlying physical considerations and derives 
useful formulae for the strength of large bundles. 

[ 405 ] *7 


Vol. 183. A. 



406 


H. E. Daniels 


The wider significance of the problem was also recognized by Peirce. He points 
out that a study of the strength properties of certain materials must involve con¬ 
siderations fundamentally similar to those arising in the theory of bundles (called 
by Peirce e composite specimens 5 ), since each element of the material may be thought 
of as made up of subelements arranged both in series and parallel along a particular 
direction of stress. Recently a notable attempt has been made by W. Weibull { 1939 ) 
to evolve a statistical theory of strength of materials, in the course of which con¬ 
sideration is given to the behaviour under a free load of materials composed of 
* independent parallel elements (the case of so-called 'incoherent irregularity 5 )*. 

The present paper opens with a preliminary section on the relation between the 
strength of a bundle and the strengths of its constituent threads, in which the work 
overlaps that of Peirce to some extent. The distribution which forms the main 
topic of the paper was not, however, discussed by Peirce. Apart from its practical 
application, it is of considerable inherent interest and appears hitherto to have 
received little attention from mathematicians. 

The relation between a bundle and its constituent threads 

3. It is assumed that for each thread there is a definite extension and load at 
which it breaks, so that threads which yield rather than snap are excluded from the 
discussion. Whatever assumptions are made about the elastic properties of the 
threads, the following sequence of events occurs when a given load is applied. 
Suppose the load is increased gradually from zero to its final value. At first it is 
distributed in some way between the n threads, and equilibrium may be reached 
without any thread giving way. If the load is great enough, however, one of the 
threads breaks at some stage and the load is redistributed among the remaining 
n—l threads, each bearing a somewhat larger share of the load than before and so 
being more likely to give way. Similarly, a number of threads may break successively 
until either a point is reached where the remaining threads have sufficient strength 
to maintain between them the final load, or no such point is reached and all the 
threads ultimately give way, in which case, of course, the bundle is broken. 

To formulate the problem more precisely, it is necessary to know how the load 
distributes itself between the individual threads of the bundle, and t his depends on 
the elastic properties of the threads. It should be observed that if they-are assumed 
to be inextensible the problem has no definite solution. A familiar example of the 
same state of affairs is that of a rigid beam supported horizontally at more than two 
points, in which case the pressures on the supports cannot be deduced. 

* Unfortunately there appears to be a flaw in his discussion of probabilities, and formulae 
are obtained which are open to question. I suggest that his equations (136) and (137) should 

TOStcL 

dS^ = 2[S(2cr)—S(a)']dS((T) 
and d^^2S(a)dS{2cr) 

respectively. The final expression for & 2/2 = & 0/2 + then agrees with the present formula 

(9*1) for n = 2. 



The statistical theory of the strength of bundles of threads 407 

Denoting load by 8 and extension by e, the load-extension relation for a given 
thread can be written 

8 = f{e,oc), 

where a represents, for simplicity, one parameter defining the curve specific to a 
particular thread. The value of a is in the general case correlated with the breaking 
extension e , or the breaking load cr = /(e, a). 


Large bundles 

4 . It is convenient first to consider very large bundles within which the actual 
numbers of threads with given properties can be treated as equal to their expecta¬ 
tions. (See, however, the remark at the end of § 7.) 

The probability of e and a lying in specified small intervals is denoted by 

<p(e,oc)docde (4*1) 

and the probability density for e for each thread is 

i/r(e) = r<f>(e,<x)dac. (4-2) 

Let a load S be applied to the bundle, and let the corresponding extension be e. 
If the bundle does not break, a state of equilibrium is set up in which there are r 
surviving threads out of the original n, where r satisfies the relation 

~ = j ijr(e)de — j* dej <f>{e,a)da. (4-3) 

At extension e the load on a particular thread is/(e, a), and when equilibrium is 
established, the total load is distributed over the surviving threads so as to make 


S = r 


\ de\ f(e, a)<j>(e, oc.) da 
Je Jo __ 

1*03 /'CO 

de <j>(e, a) da 
Je Jo 


(4-4) 


(4-5) 


Hence from (4-3) and (4-4) the load-extension relation for the bundle is 

S = n I de f f(e, a) a) da. 

Je Jo 

Breaking extension e T is attained when S has its greatest possible value S T) and 
e T is therefore the root of the equation 


£-[ de f f(e T , a) <j>{e, a) da = 0, 
d& T J e T Jo 


(4-6) 


which gives 8 its greatest value. 

It is worth noting from (4*5) that the load-extension curve is similar for all large 
bundles, the load for a given extension being directly proportional to the original 
number of constituent threads. 


57-2 



408 


H. E. Daniels 


5 . We now introduce certain restrictions on the behaviour of the threads which 
lead to simplifications in the theory. 4 

(i) In many practical eases the load-extension curves for all threads are similar 
in form, differing only in a scale factor. This might be expected, for example, when 
the parameter a is the thickness of the thread. Then 

/(e,a) = a/(e) (5-1) 


and (4*5) becomes 8 = nf{e) I a(e) ${e, cl) de, (5*2) 

where a(e) is the expectation of cl given that the thread has breaking extension e. 

The function /(e) may reasonably be taken to increase steadily with e, and no 
generality is then sacrificed if Hooke’s law is assumed, since otherwise we may always 
transform to a new c extension ? /(e). The load-extension curve is therefore taken to be 


and (5*2) becomes 


f(e,cc) = cce 
/•00 

8 = net a(e)$ 5 (e,a) de. 


(5-3) 

(5-4) 


(ii) It is found experimentally (cf. Peirce 1926 ) that the probabilities of a and e 
are often distributed independently, in which case a) = ifr(e)<fi(a) and a(e) = a, 
independent of e. Then (5-4) becomes 


8 = nae f ifr(e) de 


and the breaking extension of the bundle satisfies 

a («, jVw*} -0. 


(5-5) 


(5-6) 


(iii) Finally, the dispersion of cl may be negligibly small compared with that of 
breaking extension e. That this is a reasonable assumption in certain circumstances 
is evident on considering a thread composed of a large number N of independent 
consecutive elements, such as, for example, a long chain, the rth element obeying 
Hooke’s law with cl = a r . Under a load s each element extends by e r = sja r and the 
total extension is 

1 n « n 1 

■Jy r= 1 -Ar CCj. 

The load-extension ratio for the whole thread is thus a, where 

a N&a,' 


and it follows that as the number N of elements is increased the standard deviation 
of a decreases as 



The statistical theory of the strength of bundles of threads 409 

But if /3(s) is the chance of an element succumbing under load s, the chance of the 
whole thread giving way is 

6 ( 5 ) = l — [l—fi(s)] N . 

When N is large, and fi(s) ~ ksv (y > 0 ) near 5 = 0 , the limiting form of 6 ( 5 ) is 


6 ( 5 ) ~ 1 — e~ NK8lV . (5-7) 

Hence the expectation of s and the standard deviation of s are both proportional 
to (Nk) 11 ? and so have a ratio independent of N. f < 

We have reduced the problem to the simplest case when all threads have the same 
load-extension ratio and each thread bears an equal share of the load. Writing 
d(cr) for the probability density of breaking load a, where 



the relation between the total load S and the load s on each surviving thread at 
equilibrium is 


S = ns 



(5-8) 


and the breaking load 8 r of the bundle occurs when s = s T satisfies 

- °- <5 - 9) 

and gives 8 its greatest value S T . 

6 . A very useful way of representing these results graphically is now introduced. 
The notation 

J 0(a) da = b(s) = = a(w) ( 6 - 1 ) 


is used for the chance of a thread not exceeding s in strength, where w may be called 
the ‘weakness’ of the thread. Then (5-8) and (5-9) become 

s = *£1- fN] (6 . 2 ) 

w 


and 


d [ 1 — a(w T )] Q 
dw r w r 


The function a(w) decreases steadily from 1 to 0 as w increases from 0 to oof and a 
typical curve is shown in figure L The line PQ joining the points ( 0 , 1 ) and (n/8, 0 ) 
cuts the curve at R l9 R % whose values of w = w l9 w 2 satisfy (6*2). Both points 
therefore give possible states of equilibrium under load S, the ordinate at each 
point being the fraction of the total number n of threads which have broken at 
equilibrium. It is possible to give a physical interpretation to the region between 
w x and w 2 . When the load 8 is applied to the bundle, threads break until a fraction 



410 


H. E. Daniels 


a{w % ) have given way, at which point equihbrium is established. If now additional 
threads are cut, equilibrium is still maintained until the total fraction removed 
is a(w x ); beyond this point the cutting of an extra thread causes the whole bundle 
to collapse. In fact the region between w L and w % gives all possible states of 
equilibrium under a load S. Perhaps the argument is more clearly seen if it is 
noted that the ordinate of the line PQ gives the fraction of threads actually broken 
when the load on each thread is l/w, while the ordinate of the curve represents 
the fraction which must break under this load. Equilibrium is only possible when 
the former exceeds the latter. 

The condition (6*3) which holds at breaking load is satisfied when PQ is tangent 
to the curve at w T . When PQ is steeper than this, it does not intersect the curve at 
any point and there are no values of w at which equilibrium can be established. 



w JL w n 

1 s T 2 s 


Figure 1 
Small bundles 

7. When the bundle is small, the sample of n threads may still be considered as a 
finite population in its own right and many of the results proved are directly applic¬ 
able, but care has to be exercised in certain cases. For example, in (6*5), oi is the 
average value of a over those threads which survive at the point of rupture of the 
bundle. 

In the simplest case when all threads have the same value of a, let the stre ng t hs 
of the n threads be arranged in descending order of magnitude as cr v cr 2 , cr n . Then 
the condition for equilibrium with r survivors under a load S is 

S^rcr r (7*1) 

and the breaking load of the bundle is given by 

S T = max (ror r ). 


(7*2) 



The statistical theory of the strength of bundles of threads 411 

This rather curious rule may be illustrated by an example. Suppose that six threads 
when tested singly snapped at loads given in the first line of table 1. Their descending 
order is written, out in tiie second line and finally tbe product of load x order is given. 
Then the rule states that if all six threads had been tested as a bundle, its brea kin g 
load would have been given by the greatest of these products, that is, by 22-0. 

Table 1 

loader 3-2 5-2 4-4 8-2 6-1 5-7 

order r 6 4 5 1 2 3 

trxr 19-2 20-8 22-0* 8-2 , 12-2 17-1 

A graphical representation analogous to figure 1 is also available for small bundles, 
the curve for a(w) being replaced by the survivor diagram of w r for the sample, 
which descends from 1 to 0 in steps of 1/n. The example of table 1 is shown diagram- 
matically in figure 2. 



The formulae for very large bundles were obtained on the hypothesis that they 
all tend to have the same breaking load. It emerges later (§ 23) that while in the 
majority of cases this assumption is true, for certain populations of threads the 
breaking loads of large bundles do not tend to a limiting value with negligible 
dispersion. But even in such a case, the formulae may still be validly applied to 
the bundle considered as a finite population. 



412 


H. E. Daniels 


The probability distribution op bundle strength 

8 . ITrom now on the discussion is confined to threads having a constant value 
of a. It is assumed that the bundle is randomly sampled from a population of such 
threads and we proceed to derive an expression for the chance that a bundle of n 
threads is of strength less than S. 

It is evident on considering the breakage of successive threads that if the bundle 
succumbs under load S the conditions to be satisfied are 


0<cr n 

n 

Vn^n- 

n — 

03<(T 2 

CQ |<N 
V/ 

O’ ^ ^ 

<3, 


( 8 - 1 ) 


where cr n , ..., cr 2 , <r x are the strengths of threads breaking in order 1 , 2 , 

and the chance of the event (S' 1) occurring is 

rsin rsKn-i) i'is rs 

B n = n\\ d(cr n )da n 0(<V-i)*W" d{<y z )d<rA , 

Jar , Jar a 

the factor n ! allowing for all possible ways of arranging the threads. Setting 
b (<r)=j o mdu, b r = b (f)> = b(<r r+1 ) 

the formula is more simply written as 

*n = n\ fV-x f^„- 2 ... f V fV (8-2) 

J 0 J ^n—i J Xt J X i 

The chance (8-2) may be expressed in a variety of ways, some of which are now 
given. 


9. The slightly more general function 



satisfies the equation — = -nB Jx) 

ox n ~~ 1K 1 

withi? 0 (a;) = 1 and has the property that B n (b n ) == 0, consequently a Taylor expansion 
leads to 

0 = B n (x) - n(b n - x) B n _ x (x)+ ^ n ~ l) (h n -xf B n _ 2 (x) 


+ (-)” 1 ( b n~z) n ~ 1 B 1 (x) + (-) n (b n -x) n . (9-2) 




The statistical theory of the strength of bundles of threads 413 
Solving the set of equations with n = n,n-l,n-2, ... and x = 0 we obtain 


bl 

bl 

bl- 1 

b n 

2 ! 

3! 

(»-!)> 

nl 


bl- x 

°n-X 

blz\ 

'n-1 

21 

(n-2)1 


1 

b 

bn-Z 

br-1 

JL 


(n — 3)! 

(w.—2)! 

0 

0 

1 

h 


The corresponding formula for B n {x) is given on writing b r —x for b T in this 
expression. 

10 . There are two useful ways in which B n can be expanded in series. 

(i) The first is most directly arrived at from first principles. Consider the whole 
range of strength from 0 to oo to be divided into intervals bounded by Sjn, Sjn—1, 
...,S/2, S. The chance of a thread having strength between S/r—l and S/r is 
6 r-1 — b r . If a bundle of n threads breaks under load S, it must at least satisfy the 
condition that none of its threads exceeds $ in strength, otherwise the last survivor 
would not break. The chance that the bundle contains p x threads between S /2 and S, 
p 2 between S/3 and S/2, ..., p n _ x between Sjn — 1 and Sjn and p n less than Sjn, 
where Pi+p 2 +... +p n = n is 


n!(6i—fe 2 ) Pi (6 2 — 6 8 ) y g... (b n ^-b n )Pn-ib^n 


But further conditions have to be satisfied if the bundle is to break under load S. 
Tor equilibrium never to be possible, the number of threads in the bundle which are 
less than S/r in strength must be at least equal to r, in which case the p's have to 
satisfy conditions ( 10 - 11 ) or their equivalent form ( 10 - 12 ), viz. 


Pn> 1 

Pn+Pn- 1> 2 
Pn "b Pn —1 Pn -2 ^ ® 

Pn+Pt 1 - 1 + ••• +p 2 >n— 1 


( 10 - 11 ) 


ih ** 1 
Pl + Pz < 2 
P1 + P2 + Ps <3 


( 10 - 12 ) 


3>i+3 ? a + -+3>»-x<»-l 


Pn+Pn-1+ -» +Pz +Pl = »■ 


p 1 +p z +...+p n = n 


The chance of the bundle breaking under load S is therefore given by 


B n = £ 

P 


nUfiy-bffi (b 2 -b s )P* ... (6„_i-6jP"-i6g B 


Pl'-Pz'- -Pn-l'-Pn'- 


( 10 - 2 ) 


where the p’s are summed over all values consistent with ( 10 * 11 ) or ( 10 - 12 ). The 
corresponding formula for B. n {x) is obtained on writing b T —x for b r . 







414 


H. E. Daniels 

(ii) The second series expansion is the multiple Taylor expansion of the deter¬ 
minant in powers of the & r ’s. We have 




where ( ) 0 means that all the b r ’a are replaced by 0 in the bracketed expression. 
Many of the terms of this expansion vanish on account of two rows of the determinant 
becoming identical after differentiation when the b r ’s are made zero, and in fact the 
only non-vanishing terms are obtained as follows. Numbering the rows from the 
bottom upwards, first the r x th row is differentiated r x times, then the (r x +r 2 )th row 
r 2 times, the (r 1 + r 2 + r 3 )th row r 3 times and so on, till finally the nth (top) row is 
differentiated r m times where r 1 + r 2 +... +r m = n, it being essential to differentiate 
he top row at least once. If the resulting determinant is then rearranged with the 
r 2 th row at the bottom, the (r 1 + r 2 )th row in place of the r x th, the (»\+r 2 +r,)th in 
place of the (r x +r 2 )th and so on, the sign of the determinant is changed by 


( _ )ri—l+r 4 —l+...+r M —l = ( _ jn-m 

and its value becomes unity. The expansion is therefore 


B n = S -)n-™ nlhr r\K[+r, ••• .K” 

m ~ lr r l lr 2 l 

summed over all values of the r’s such that r,>l, all t , and r 1 + r 2 +... +r = n _ 
Using this expansion, the following formulae are obtained 

B o = 1> B i = K B 2 = 2b 1 b 3 -b if, 

B a = Qb^bz-Zb\b 3 —36,61 — 61, 

= 2 ^rb 3 b 3 bi-\2b\b 3 b i -12b 1 blb i -l2b l b 3 bl-4 c blbi+&b\bl-ib 1 bl-bi 

" pe ' 1 T t“ p l er *° Md * ie ' n ” p ly meMB cf »»' 

formula (9 2) with * = 0. For higher values of n the formulae rapidly become 
unmanageable, and it is necessary to seek an asymptotic expression for B n . Before 

wCf 'fh 8 ^ q " GStl ° n ’ h ° Wever ’ mention must be made of two forms of b(s) 
hich, though not common ones in practice, give rise to specially simple B 
expressions for all values of 71 . P » 

and^^hTt^ ^ 0356 Wtere the threadS haVe 0nly two P ossible strengths p 

b(s) = 0 (0<s <p) ; ' 

b( s ) = b (p*Zs<vp), ■ 

H s ) = 1 (vp^s), J 

and let a load S be applied to a bundle of n of such threads. 



The statistical theory of the strength of bundles of threads 415 

We observe first that b n = 0 and hence B n = 0 whenever S < np. Divide mp the 
range of 8 into the intervals rvp ^ S < (r +1) vp. Then for values of 8 > np we have, 
when 8 is in the rth interval, 

bm = b {mj = 1 

b m = b ( m>r ). 


r = 


This holds for all intervals having r > 
~ri 


v 


and also for that part of the interval 


in which 8 ^ np . 


Consider now the series expansion (10*2). When 8>np and S is in the rth interval 
the only non-vanishing terms are those in which p m = 0 except form = r and m = n. 
Writing p r = p, p n = n—p conditions (11*12) imply that p^r and the expression 
for B n becomes, with this special form for b(s) : 

B n = 0 when S < np , 

nl(l-b)vb n -*> 


B n = S 

p=0 


when rvp^S <{r+l)vp and S^npA 


( 11 - 2 ) 


\r>\ 


or np. 


p\(n—p)\ 

The strength of the bundle can therefore only have values rvp (r 
the probability of the value rvp being the coefficient of t r in [6+i(l — 6)] n and the 
probability of np the sum of the coefficients of P for values of p < jjyJ. The approxi¬ 
mate form of B n when n is large is in this case easily deduced from the normal 
approximation to the binomial distribution. For very large n there are two distinct 
limiting forms according to whether the ‘collapsed’ part contains the greater or 
lesser number of binomial terms, the appropriate conditions being v(l — b) < 1 or >1 
respectively. In the first case all values concentrate at S = np; in the second they 
concentrate at nvp(l — b). These ultimate limiting forms are immediately obvious 
on applying the method of § 6 for very large bundles. 

12. The second form of b(s) is that in which the weakness w = 1/s of a thread 
has an equal chance of lying anywhere in the interval 0 < w < l/s 0 but cannot exceed 


l/s 0 , that is 


b(s) = 0 (0<s<s 0 )] 

b(s) = 1 f (sq^s). 

o 


( 12 - 1 ) 


We now show that the chance that a bundle of n threads sampled from such a 
population has strength less than 8 is 

B n - 0 (0 < 8 < ns 0 )' 

B.- 1-f («,<«. } (12 ' 2> 

so that for all values of n the probability distribution of 8jn is simply b(S/n). 



416 


H. E. Daniels 


The proof is as follows. Evidently b n = 0 when 0 < 8 < ns 0 and hence B n vanishes 
forvalues of S in this interval. Let S ^ and insert b r = 1 -^rin(9-l). The function 

*«(*)“( l-z)*-^(l-*Y^ 

is such that B 0 (x) = 1, B n (\—= 0 and it satisfies 


dBJx) 

dx 


-nB n _ x {x). 


It must therefore be identical with ( 9 - 1 ) for the particular form of b r chosen, and 
putting x = 0 the result follows. 

The method already given for large bundles breaks down in this case since 

$[1 — 6 ( s )] = 

for and there is no unique value of s r which maximizes it. It will be of 
interest later to recall (12*2) in connexion with the asymptotic form of B n when 
b($) behaves like 1— s Q /s for large s (§ 23 ). . 


The asymptotic behaviour oe B n for large n 

13 . It has been shown that as n becomes very large the bundle strength is 
likely to concentrate near a value S r == 7imax«s[l —6(5)]. IT o indication has so far 
been given, however, except in two special cases of the form assumed by B n in the 
vicinity of S T , and the remainder of the paper is devoted to a study of the asymptotic 
behaviour of B n for general forms of the parent distribution b(s). The mode of attack 
adopted here provides the dominant term of the asymptotic expansion together 
with the order of magnitude of the approximation involved. 

It is first necessary to derive an important identity upon which the subsequent 
theory depends; Let us introduce the notation 


dx n _ x dx n . 

JO J Xn—i 

-2*** I dx m 

J + X 


h 

bi 

hn—m—1 

u n 

b n- m 

u n 

2! •" 

(n—m — 1)! 

(n — m)l 

1 

b , 

An-m—2 
°n-1 

Cr* 

1 i 

j 2 

A 


u n— 1 

(n — m — 2)! 

(n—m— 1)! 


(13*1) 


1 




0 0 




The statistical theory of the strength of bundles of threads 417 

and {n,n) = 1, so that, for example, B n = ft! (ft, 0). Now in the determinant (»,0) 

jj2 

mnltiply the (n — 1 )th column hy — x, the (ft—2)th column by —,..., the 1st column 


by(-) 


_ )n—l _ 


r n—l 


(n-l)l 

(», 0) = 


and add them to the last column, thus obtaining 

K' 1 (b n ~x) n 


6 2 b 3 

u u n u n 

n 2! 3! 


K-i 


bj-i 

2 ! 


J n-2 


(n-1)! 

(n — 2)! 

bjZl 
(n — 3 )! 


_ ( —— 

n\ K ' n\ 

{b n -i~ x ) n ~ X 
(»“ 1) ! 

(K-2~x) n ~ 2 

(n-2 )! 


0 0 0 ... 1 b x — x 

[ The expansion of the determinant in terms of the last column may then be 
rearranged to give 

ry-n n (o~ — h \m 

(13*21) 


x 


n _ h 

nI - S for all x, 

n\ m =0 


ml 


or, more generally, after differentiating r times, 


Y>n—r 


n (x — b ) m '~ r 

. s for alio; and r^O. 

(n—r)\ r£r (m-r)l 


(13-22) 


One observes in passing that (n, m) is related to the chance B n m that m threads 
will survive under a load S by the formula 

Ti t 

B n>m = ^(n im )(l-b m r. 


Substituting in (13-21) leads to 




* n = S Bn, 

m~0 


For example, when x = 1 the result 1 = 2 B nm simply expresses the fact that 

* 

values of m from 0 to n exhaust all possible contingencies. t 

The identities (13*21) and (13*22) are not in the form most convenient for our 
purpose. Writing x = 1/A, (13*21) becomes 

1. .UA, 

m =>0 m! 

and by analogy with the binomial distribution, one is led to consider the expression 

” (.-»>!(13-3) 

O m JO J Xn-1 J Xm-i-1 




418 H. E. Daniels 

which satisfies, and is in fact uniquely determined by, 


n 

i- 2< 

m— 0 


n,m ’ 


an A, 


or the more general identity 

{n-r)\{icb m y 


1= se 


m—r 


n,m~ 


m-Kb m r 


aU /c, all r > 0, 


(13-4) 


(13-5) 


(to —r)! («—to)! 

where /c is written in place of A for a reason that will be apparent later. Note 
that Q n 0 is simply B n . 

The function T n m r 

1 f rge i’ ( 13 ‘ 4 ). and (j 3 ' 5 ) approximate to integral equations and the 
behaviour of Q n m for large n is evidently related to that of 

(n-r)l (Kb m ) n ~ m (1 - Kb m )rn-r 
(m — r) \ (w.-m) f 

asymptotic 


T = 



behaviour of T„ 


n,m,r m 


It wiU be found simplest first to discuss the special case 

T (1-A6J— . 

, n ’ m to! (ti-to)! ’ 

and to extend the results afterwards to the more general function. The important 
, range of Als °<A<1 m which T n>m is never negative. In the case of the ordinary 

binomial distribution -JL—£LJjL there are two distinct hmiting forms for large n. 

When neither p nor 1 -p is smaU the binomial is approximated to by a continuous 
normal distribution having the same first and second moments np and np(l-p) 
respectively. On the other hand, when* is small the appropriate limiting form is the 
discrete Poisson distribution with parameter np, and similarly for small 1-v In 
the present more general case it is useful to preserve the distinction, and we first 
discuss various continuous limiting forms corresponding to the normal apppoxima- 


Continuous limiting forms of T 

Jk ^ °»/» no, i ie 

m _ » n+ *(AhJ»- «»(l-A6.)”» r 
n,m V277 m m+i (n— ^ x )]- 

f S T " ~ “ d «<W - m a* defined in 

§ 7, the approximation is ^ 

T n (z)dz = l(-) _ f [~1 - Aafa/rn«, _ „ 

\ WV[ z (!-z)]L 1-z J j_ z J [ 1 + ( ^( w ' )]• (15-1) 



The statistical theory of the strength of bundles of threads 419 
Clearly the important regions for T n (z) are near the maxima of' 

F{z) = 

= j^l_ (1 — z - Aa) J (1 ~ 2) ^ + (l-z-Au) J 
jpr /j_ zXcb) / A \ 

Differentiating, = - - - - g y ^ 1 + ^ a' j + higher powers of (1 — z - Aa), 

F" (F '\ 2 ( 1+ £ a ) 

y-yfj = ~ '"" ( I j + terms containing (1 - z - Aa). 

The condition F'(z) = 0 is thus at least satisfied when 

l-z-Aa(z/£) = 0 (15-3) 

and at the roots z = z r of this equation it is found that 

F(z r ) = 1, F"(z r ) = -[l +^a'(z r l0jjz r (l -z r ). (15-4) 

It follows that the roots z r are all maxima of F(z). 

Moreover, at all other points in 0 < z < 1 one must necessarily have F(z) < 1. For 
consider F — e G as a function of A. Turning points occur when G x = 0, where 

_ /l — z z \ _ (1 —z —Ao) 

A ~ a \ Aa 1 — Aa) ~ A(l-Aa) 


(15-3) 


and for given z there is only one value of A satisfying G k — 0, namely, that making 
1 — z — Act = 0. Furthermore 

e “—»1^ + (izW.}' 

showing that G xx < 0 for all z and A lying between 0 and 1. Hence G has a unique 
maximum at the point A = (1 —z)ja and so has F for the same A, taking the value 
F = 1 there, and decreasing steadily on either side of it. No values of z and A between 
0 and 1 can therefore be found at which F > 1 and 1—z—Aa+0, for if such a value 
existed, then at A = (1 — z)/ct one should also have F = 1,1— z —Aa = 0 which would 
imply G xx ^ 0 at some point of the range. 

Considering F once more as a function of z for fixed A, we conclude that when n 
is large, F n is negligible except near the roots z = -z r of (15-3) at which it approximates 
to 1, and consequently TJz) consists of a series of isolated peaks situated at these 
roots. 

The positions of the roots are conveniently examined by means of a diagram 
(figure 3) similar to that introduced in §7. A typical curve for a(w) is shown. The 
line joining the point (0,1/A) to the point (l/£f0) intersects the curve at points 



420 


H. E. Daniels 


sj£, z 2 /£, z 3 /£, and it is evident that z v z 2 , z 3 so found are the only positive roots of 
1 _ z - Aa(z/£) = 0. With an a{w) curve of the form shown a m a xi m u m of 3 positive 
real roots is possible with suitable values of £ and A, and in general the maximum 
number of roots is odd. 



16. Consider first the behaviour of T n (z) near a simple real root z T such that neither 
z r nor 1— z r is small. On substituting 

1 -z-Aa(z/£) = -£l+|a'(z r /£)J (z-z r ) 2 + 0(z-z r ) 3 

in (16-2) the appropriate expansion of (15-1) is found to be 


{1 + 0{n{z - z r f) + 0(z-z r )}. 


// n y 

| ex P 

n 

l+|a'(z r /0 

(Z-Zr) 2 j 

1 \ 2nz r (l-z r )j 


2 z r (l-z r ) J 


Since the exponential is negligibly small except when n(z—z r ) 2 = 0(1), only 
the range z—z r = 0(n ~■*) need be considered and the approximation factor is 
1 + It will be found more useful, however, to write the result in the form 


ll U ^ ext) 

n 1+|a'(z r /£) (z-z r ) z 

J \27rz T (l -z r )J P 

2z r (l-z r ) J 


x {1 + A {z—z r ) + Bn(z - z r f + Ofo - 1 )}, (16* 1) 
where the constants A and B are 0(1). 


The statistical theory of the strength of bundles of threads 421 

17. Suppose next that the two roots z a and z 3 , which may he real or complex, are 
such that | z«j—z 3 1 is small. It is assumed that neither | z r | nor [ l-z r | is small 
(r = 2,3). 

Write e = £(z 3 -z 2 ) 

and /(z) = 1 — z—Aa(z/£). 

Let I be that value of z near z 2 , z 3 which makes/'(z) = 0; then since 
/(z) = /(z) + i(z - z) 2 /"(z) + 0(z- z) 3 
and /(z 2 ) =/(z 3 ) = 0, /"(z) = ~a"(z/£) 

it follows, if f"(z) is not small and z—z = 0(e), that 

/(z) = -^^[(z-5) a -e 2 ] + 0(e8) (17-1) 

and z = |(z 2 +z 3 ) + 0(e 2 ). 

The sign of e 2 may be either positive or negative according as the roots are real or 
complex. 

If the parameters A and £ are altered until z 2 = z 3 , we shall caE the coincident 
value of the roots a critical point of z. In particular, two critical points z T , z c are 
obtained by (i) keeping A fixed and changing £ or (ii) keeping £ fixed and changing 
A, respectively, the corresponding critical values of £ and A being £ T and A c . It will 
be shown that both z T and z e differ from z by 0(e 2 ). They are Elustrated graphicaEy 
in figure 4. 

Expressions for T n {z) are now obtained in terms of z T and z c respectively. 

(i) Put w = z/£, w T = z t /£ t , and let 

= £ T + (v>-w T )£' T + i(w-w T )*£+0(w-w T ) 3 , 
where the suffix r denotes evaluation at w r . By (15*3), 

» = « = £ C = o, 

In general is not small, so that 

w T = \{w 2 +w 3 ) + 0(e 2 ), e 2 = |£?(w 2 - m> 8 ) 2 + 0{^) 


or, to the same order 

Z T - 2 + 0(e 2 ), e 2 = - ^ (C- Cr) + 0(e 2 ), 

and inserting these results in (16' 1) and (15 - 2) the form of F{z) in the range 
z—z T = 0(e) is found to be . 




Vol. 183. A. 


2$ 



422 


H. E. Daniels 


Now e has to be at most 0{n~*) for otherwise T n (z) in (15-1) becomes small at z — z T 
and z a , z 3 may then be considered as isolated simple roots. Also, in the vicinity of z,, 
TJz) is negligible outside a range z-z T = 0{%~ i ) and its approximate form is 
therefore 


T=ii) 


exp - 


n 


2z t (1- 


Zr)L2g 


"\ n ,f „ 
Aa i( Z -z T r+ z ^^-Q 




provided 


z T =0(l), 1—z T = 0(1). 


{l + 0(»-*)} 
(17-2) 



j_ 2 * 

(“) ^ A(z) = = A 0+ iz-Zc)K+W-Zc)*K+ 0{z-z c f. 

By definition A(z a ) = A(z 3 ) = 0, and A' = 0, A" = —X c a” c j^a c , the suffix c denoting 
evaluation at z c . Since A" is not usually small, it follows that 


z c = z+ 0(e 2 ), e 2 = -? fe ‘® c 


^#(A-A c ) + 0(e3) 

and, as before, the approximate form of T n {z) is 


(17-3) 


The statistical theory of the strength of bundles of threads 423 

There is one other continuous limiting form of T n (z) which has to be considered. 
When the first root z x is isolated and small the approximation to T n (z) in its vicinity 
is no longer given by (16*1), the discrete limiting form discussed below (§ 20) taking 
its place. But when z x and z % are close together and both small, a continuous approxi¬ 
mation to T n (z) is still appropriate. It is best to discuss this case later (§ 19) in its 
more general form, after the results so far obtained have been extended to T n>mr . 


Continuous limiting forms of T n;m>r 
18. The asymptotic behaviour of the function , 

_ {n-r)\(Kb m ) n - m (1 - Kb m ) m 

w (»■—r)l 

is discussed on similar lines and the proofs will be only briefly sketched. For T n m>r 
to be never negative in the range r^m^n, k has to satisfy the condition 


l/b r . 


(18-1) 


The continuous approximations hold when neither (' m—r)/n nor 1— mjn is small, 
and Stirling’s formula then gives 

T = ( n ~r) n ~ r+l ( Kb J n ~ m (j ~ Kh m) m ~ r n + 0(»-i)l. 
n,m ‘ r \I2 tt (m - r) m ~ r+i (n - m) n ~ m+i 

Adopting the previous notation and putting in addition 

r = nx, T w = T n (z,x)dz 


this becomes 


Tn(l-z) 


x [1 + or 1 )], (18-2) 

where x<z< 1, and we may write 


where A = (l —x)k. 

By (18*1), A must satisfy the condition 

0 < A < A^ = (1 -x)Ja(xf£). 


(18-3) 

(18-4) 

(18-5) 



424 


H. E. Daniels 


F(z,x) then takes its maximum value F - 1 at the roots z r of l-z—\a(zj£) = o, 
and the argument proceeds as before. Near an isolated simple root z r such that 
x<z r < 1, z r —x = 0(1), 1— z r — 0(1) the limiting form of T n (z, x) is 


*\- 2K-»)(l-4 -/ [1 + 

* (18-6) 

and near a critical point the formulae corresponding to (17*2) and (17-3) are 

x eip (-2 R-1 hiU) % <*-*>*.+!«:-C)] 2 }[1+0(»->)] (18-7) 
T n (z,x) = //,_ n(l-x) _\ 

Xe ^(-2S?Sj^)[^<'-^’ + fW-^J}[l + 0(’*-‘)]. (18-8) 


19. Suppose now that a; is in the neighbourhood of the critical point. Let 
z c~ x = °( e )> then, on substituting (17-1) in (18-3), F(z,x) assumes the form 

since 1-z = l~x+ 0(e), and (18-2) becomes 


xeip { _ 25r5)[^< z - z <>’+|( A - A .)]}[ 1 + 0(e)+0(»e‘)]. (19-1) 


Por the exponent to be 0(1) at z = z c we must have e = 0(n~i) and so, over a 
range z-z c = 0(n~*), (19-1) holds te 0(n~i). 

In the discussion of the limiting form of Q n>m , however, it will be necessary to 
assume e = 0(n~t) m this case. At z = z c the exponent is then 0(ni) and T n (z, x) splits 
mto two isolated peaks at z 2 , z s . The behaviour of T n (z,x) near z 3 is of interest, and 
one would expect the approximation (18-6) to hold there, but to a lower order of 
accuracy. To study T n (z, x) near z 3 , write (17-1) as 


/(*) = -^ C (z-2 2 )(z-z 3 ) + 0(e 3 ). 



The statistical theory of the strength of bundles of threads 425 

TJz, x) is not negligible in a range of z—z 3 satisfying 

«AX 2 (s-z 2 ) 2 (z-z 3 ) 2 
8£ 2 (z-x) ~ U[i) 

and since z — z 2 and z—x are 0(n~<). this implies 

z-z 3 = 0(n~*). 

The approximation near z 3 when e = is then found to be 

T n {z,x) - J (2^3^)) 

x exp[- n -^ (% ~- - -] [1 + Ani(z -z 3 ) + Bn(z - z s )* + 0(»-*)], 
3 (19-2) 

where A, B are 0(1) and independent of z. The formula agrees to 0(n _l ) with (18-6). 


Discrete limiting farms of T n m 
20. Reverting once more to 

T) t 

T =_—_ (Xb ) n ~ m (1 — Ab ) m 

n - m m\(n-m)\ ( m) 1 m> 

the analogue of the Poisson binomial limit, when n is large but m is 0(1) is next 
discussed. It is assumed that a(z/£) admits of an expansion in integral powers of z 
near z = 0. The possibility of an expansion in non-integral powers is realized, and, 
in fact, such an expansion is to be expected in some cases in view of the limiting 
form (5-7). But the more general analysis is not essentially different, and for sim¬ 
plicity only integral powers are considered. The expansion of a(z/Q is then 


a(z/0 = l+a'(0)|+0(z 2 ), 


i.e. bm = iJ^ + 0 (n->) (20-1) 

where l/£ 0 is the intercept on the horizontal axis of the tangent to the a(w) curve 
at the origin. Two types of a(w) curve are shown in figures 5 and 6. Since the 
important values of A turn out to be near A = 1, we put 


where ji is 0(1). Then 




{n - m) log Xb m = -(/i+ m£ 0 /£) + 0(n x ) 


(l-A&J = \<jJi+mZ 0 IQ + O{n~% 


and 




T = ^^( / ._+m go/gr 

m! (»—m) n-OT+i » m L1 + ^ JJ 


\ n-m+1 


The statistical theory of the strength of bundles of thteads 427 

Hence 

and since ^1 — = e -m [l + 0(» -1 )], 

the appropriate limiting form of T n m is 

T n ,m = + e-0**W> [l + 0(»- 4 )]. (20-2) 


The majority of distributions encountered in practice are such that Co — 0 for 
this condition implies that the chance that a thread exceeds S in strength tends to 
zero more rapidly than 1/8 as 8 increases. In that case (20-2) degenerates to the 
Poisson formula 

2km = £]^[l + 0 ( n-i)]. ( 20 ‘ 3 ) 

It is a useful confirmation of a later result to prove that when C> Co an d C~£o 
not small 

” (ji +w^ 0 /£) m g-^+wgd/a = L—.. (20-4) 

m-o m! 1 — So/fc 


which is independent of y. 

The series converges since the ratio of the (in +1 )th to the mth term tends to 
£ 0 /£.exp 1 - £ 0 /£ which is less than 1 except when £ = £ 0 . It may he written in the 
form 

00 1 f d<u 

where the contour (7 encloses ^ = 1 and is such that | exp — uCo/C | <) 1 — n | at every 
point on it. The curve 11 — u | = v | exp — ^Co/£ | satisfies this condition and has as 
one of Its branches a closed loop containing u = 1, and also u = 0, provided that 

l<v< C/Co. exp (Co/C*~ 1) 


so C is chosen to be this loop. The series therefore has the sum 


1 r er^du 
2m JqU— l + e-vtofe' 


The integrand has a simple pole inside the contour at which the residue is 1/(1 — Co/O 
and some consideration will show that there are no other singularities within (7. 
Consequently the sum of the series is 1/(1 — Co/0 when C> Co? which is the required 
result. 


Discrete limiting form of T n>m>r 
21. The corresponding discrete limiting form of 


T ss 


( n _ (irh —Kb ) n 

(m-r)\(n-m)\ { m) { m) 


(0</c< lfb r ), 



428 


H. E. Daniels 


is appropriate when. n,m and rare all large, but m —r is 0(1). Retaining the notation 
of the previous sections, we expand a(z /£) near x as 


i.e. 


a(z/£) = a(x/Q + ( ' Z ^ a'{xjQ + 0(z-x) 2 
6 m = 6 r [l-^- } |] + OM 


( 21 - 1 ) 


a'(xl£) 

where --Mpr ? and l/£ x is the subtangent to the a(w) curve at w = xj£. The 

important values of k are near its maximum 1 jb r and accordingly we write 

*-sH) 

or in terms of A = /c(l-a;), A = A a .|l-^j. 

\ 

Then (n - m) log Kb m = - ^ [> + (to - r) £*/£] + 0(» _1 ) 


( 21 - 2 ) 


n 


= _(i_*)[^+(m-r)M + 0(n-i), 


1 = -|>+(m-r) M + 0(n~ 2 ) 

IV 


and T n m>r reduces to 


T w = { ~ - x)m r) ^ ]OT r {l + 0( W -1)]. (21-3) 


(21-4) 


(m—r)\ 

It follows immediately from (20-4) that, to 0(w _1 ), 

* 

°° 1 1 

r£r Tw = , (l-Z)& = 77MJI’ 

c s « 

provided that the denominator is not small, or in other words, that x is not near a 
critical point. 

Limiting form of Q n m for large n 

22. One can now examine the asymptotic behaviour of Q n m and ultimately that 
of -Bm = Q n ,o by means of the identity (13-5), viz. 


£ n (n-r)l (x6 m )»->» (1 - tcbj™-* _, 
m=, ”’ m (m-r)!(»-i»)l = 1 


(13-5) 



The statistical theory of the strength of bundles of threads 429 

The limiting form of (13-5) depends on the situation of the roots of 

l-z-Aa(z/Q = 0 

which lie in the range x < z ^ 1 , where, as defined above, 

A = x(l — x), 0^A<A a . = (l —x)/a{xl£). 

The discussion is first confined to what we call the ‘non-critical’ regions where z 
is not near a critical point. Suppose, for simplicity, that the a(w) curve is of the type 
shown in figure 7 having at most 3 positive real roots. As before, A c denotes the 
critical value of A at which the two largest roots z 2 , z 3 coincide at z c for fixed £, 
the smallest root then being z c ,. 



Consider what happens as x changes from 1 to 0, £ being fixed, confining attention 
to the non-critical regions. 

(i) z c < x < 1. When a; is in this range there is only one root z 3 in x < z < 1 and if 
neither z s —x nor 1—z 3 are small, ( 16 - 1 ) may be substituted in ( 13 * 5 ) to give 


/><W( BsSfe ) 


xexp 
= 1 + 0(n~ 1 ) 


2tt(z z -x)(1- 

(2- z s) 2 

2(z 3 -a;)(l-z 3 )* . 


[l+A{z-z z ) + Bn(z - z 3 ) 3 ] 

< 22 - 1 ) 


where Q n (z) is written for Q n>m . 


430 


H. E. Daniels 


On the other hand, when z 3 —x is small so that to —r = 0(1) and A = A^l — 
with fi = 0(1) the appropriate limiting form is, from (21-3), 


1 (m—r)\ 


e -a-i)(/.+(i»-«) = 1 + 0(n~ l ) (22-2) 


where £* = -a'(xJQ/a(x/Q. 

It is now proved that when z c < z < 1 and z — z c = 0(1), 


Qn{z) ~ l 1 + “ (1 + ° {n 1))- 


(22-3) 


The values /t = 0 and n = {J£ in (22-2) give respectively 

_ 1+0M . 

Writing r +1 for r and multiplying through by (1 — x) the latter becomes 

Ji +l Qn ’ m ~ ^ (m-r) f xlQm ~ r ^~ x ^^ = (1-*)W1 + 0M) 

whence, on subtraction. 


Qn,r = QJx) = {l-(l-a) {JO (1 + 0(n~i)) 


which is equivalent to (22-3). The proof can be made rigorous by inserting the 
approximation factor exp A(m-r)*/n in place of 1 + 0(ri~ x ) and confining the sum¬ 
mation to the range in which this factor does not begin to dominate, the remainder 
being demonstrably negligible. 

The same expression for Q n (z) is seen to satisfy (22-1), as it should, for then, 
to 0(n- x ), 


Q (z*) = 14- — z a) a '( z J0 _ 1 , A /n 

VnUl + g a(z 3 /^)~ 1 + ^ a ( z al& 


and expanding QJz) in (22-1) as Q n (z s ) + Q' n (z 3 )(z-z s ) + 0(z-z^, the first term 
integrates to 1, the odd powers ofz—z s vanish on integration, and 0(z—z^f — 0(n~ x ) 
in the effective range of 2— z 3 . 

It also follows that over the important range of to in (22-2), .differs from 

Q n (x) only by 0(n~ x ) and on substituting this value for Q nm the result (21-4) is 
verified. 

(“) v < * < V Avoiding for the present the critical region near » v we next discuss 
the range between % and z c . There are two cases typified by the values x' and x" 
m figure 7, both corresponding to A = X x as shown. 



The statistical theory of the strength of bundles of threads 431 


Consider first the case of x\ When \<\ x the only root greater than x f is z Z) which 
is also greater than x, and (22*1) holds with x r for x. On making A = however, 
the roots are x' and x, and (13*5) becomes 


QnS Tnf,r’ + Qnf+l ^V+iy + — + j_J^^ dz J {2^X-x’) ( 1 -*)) 


xexp 


n(l—x') l+^cr'(a;/£)J ( z-x ) a 


2{x—x') (1 — cc) 


= 1 + 0(n~ x ), 


[1 + A{z — x) + Bn(z —*) 3 ] 

(22-4) 


r 

where — = x', and it is sufficient to note that T n yy = 1 + 0{n~ x ). 

The integral reduces to 1 + 0(n~ 1 ) by virtue of (22-3), and so 

Qns = «*(*') = 0(n~ x ). (22-5) 

In the case of x!', when we make A = A x there are three roots at x ", x’ and x; then 
(13-5) takes a form similar to (22-4) but with r" for r', and two integrals in the 
neighbourhoods of x' and x respectively. The first integral vanishes by (22-5), the 
second is 1 + 0(n~ x ), and as before Q n y = Q n (x") — 0(n~ x ). 

When x"—x' is small the discussion is a little more elaborate but the same result 
is true. Hence for any value of z in the non-critical region between z c . and z c , 

Q n {z) = 0(n~ x ). ■ (22-6) 


(iii) 0 < a; < z c ,. As x" passes through the value z 0 - into the range 0 <»"<z c -, the 
other roots x', x for A’= A x coalesce and become complex, so by the previous argu¬ 
ment Q n (z) again satisfies (22-3). 

THe conclusions are therefore summarized as follows. Let a(w) be such that 
1 — z—Aa(z/£) = 0 has at most 3 positive real roots, and let £ be chosen so that when 
z 2 and z 3 are made to coalesce at z c for suitable A = A c the smallest root z x has a value 
z c > > 0. Then the approximate form of Q n (z) in the non-critical regions of z is 


K 

-z)a'(z/£)l 

&W0 J 

|(! + 

0(n~ x )) 

in z c <z< 1, 

0(n~ x ) 




in z ti <z< z c , • 

K 

-z) u'(z/£)l 

&*(*/£) J 

l (1 + 

0(n~ x )) 

in 0<z<z^. 


By varying £ the range z c ,<z<z c may be made to vanish or to include z = 0 but 
in all cases (22-6) holds throughout any part of this range, and (22-3) otherwise 
(excluding values of z near % c or z c ). The argument can evidently be extended to 
forms of a(w) where the maximum number of roots is greater than 3. 

23. The result just obtained has an immediate application to B n = Q nfi . Let 
A = 1 so that the lowest root z 1 = 0 and allow £ to vary, the critical value £, corre¬ 
sponding to the double root z T . Suppose a(w) to have the form shown in figure 8. 



432 


H. E. Daniels 


The remaining non-zero roots z 2 , z 3 are real or complex according as £ < £ T or 

£>£ T . Putting z = 0, a(0) = 1, a'(0) = — £ 0 in (22-7), it can therefore be stated that 

B n = Qn,o = 0(n-i) when £<£ T /I 

B n = (l-U0(l + 0(n~')) when £>£„/ ^ ’ 

provided | £-£ T | = 0(1). 

Figure 9 illustrates the special case when £ r = £ 0 and there is only one non-zero 
root. 

The fact that B n ~l — £ 0 /£ when £ > £ T is perhaps not unexpected in view of the 
similar exact expression for B n (12-2) which holds for a form of a{w) to which the 
present more general case approximates near w = 0. But the result (23-1) sheds an 
interesting light on the argument and conclusions reached about large bundles in 
the early part of the paper (§§ 4,5,6). It was deduced there that all very large bundles 
of a given size n would break at loads approximating to nt, T in the present notation, 
provided that a unique value of £ r exists. Evidently this is not the whole truth 
of the matter. If the chance 1 — b(s) of a thread exceeding s in strength tends to zero 
at a rate proportional to 1/s, then, however large the bundle may#be, its strength 
8 = %£ can take values ranging from »£ T to co with probabilities according to (23-1). 
In such eases (23*1) provides all the information necessary about B n for large n, 
since the small excluded region near £ r is negligible compared with the wide range 
over which B n varies slowly with £. In practice, however, it is generally found that 
1—f»(s) tends to zero more rapidly than 1/s so that £ 0 = 0, and (23-1) then gives no 
information about B n in the region where it differs appreciably from 0 or 1. 

24. Finally the behaviour of Q n m in the critical region is discussed. Keeping 
A = 1, = 0, let £ now approach £ r . The two largest roots z 2 , z 3 come inijg the 

vicinity of z T , and on substituting (18-7) and (20-2) the appropriate limit ing form of 
(13-5) is found to be 


S Qn,m 
m—0 


where 


(m £ 0 JQ 

m 


x 


era) 

“ p (" - K<i -«,) [jb ^ (zz,) ‘ + i 1&) ]) 

£-£ r = 0(»-i). 


1 + 0(n~i) (24-1) 


He most commonly occurring type of thread population has £ 0 = 0 and (24-1) 
then reduces to 

When So=t= 0 it can be shown by an argument similar to that used to prove (22*3) 
that the effect is to multiply the right-hand side of this equation by a factor 1 — So/C- 



The statistical theory of the strength of bundles of threads 433 

The behaviour of B n for values of £ satisfying £-£ T = 0(n ~*) is thus seen to 
depend on that of Q n (z) over a range of z such that z-z T = 0(n~i). Therefore consider 




the limiting form of (13:5) when a; is in the critical region and z a —z 2 =0(n~ i ). It is 
more convenient for the moment to work in terms of z c . Suppose first that z c <x, 
and now make use of the result (19-2) that near z 3 , T n (z, x) take the form appropriate 


434 


H. E. Daniels 


to an isolated simple root, but to the order 0(w -i ) only. Since z 3 is the only root 
in x < z < 1, (13-5) becomes 




m AX ' 2 (z 8 ~ z c ) 2 

2^ 4 (z 3 -») 

x [1 + An i (z — z z ) 4* J3w(2 


(z-zs) 2 ] 

z 3 ) 3 ] = 1 + ()(%-*) (24-3) 


\ rr 

and this is satisfied by Q n (z) = (z-z c ) [1 + 0(w*)] (24-4) 

which is the same as (22-3) to 0(n~^). Such an argument does not establish (24-4) 
as the unique solution of (24-3), but it is possible to construct a direct rigorous proof 
on the lines of § 22 (i), though the details are tedious. 

By taking a; <z c ,z c —x — the behaviour of Q n {z) when z<z c ,z c —z = 0(n ~*) 

may be studied by a similar extension of § 22 (ii), and it is found that 

Q n {z) = 0(n~i) (24-5) 

where z<z c , z c -z=0(n~i). 


The asymptotic form of B n 

25. Applying these formulae to (24-2) one writes to 0(n~*) 

A c =l, z c = z r; AX/S 2 - <!&■ 

The contribution to the integral in (24-2) from points in a range of z—z T smaller than 
0(n~i) can be shown to be negligible to 0(n~*). and it is concluded that 




which reduces on setting 


Z r„ 


i y= 2g {z ~ ZT)2+ i^~ u 

That is, the probability distribution of C tends to the normal form for large n 
with expectation C T and standard deviation 


to 


B, 


When Co is not.zero the formula for B n is simply multiplied by 1 - C/Co- 



The statistical theory of the strength of bundles of threads 435 

The final result restated in terms of the earlier notation is therefore as follows. 
If all"threads have the same load-extension curve, and the chance b(s) of a thread 
breaking under load s is such that 1 -b(s) tends to zero faster than l/s, then the 
strength S of a bundle of n threads has a probability distribution which tends for 
large n to the normal form with expectation 


S T = ns T [l-b(s T )] 

and standard deviation cr — s T ^J{nb(s T ) [1 — &(s T )]}, 

where s T gives s[ 1 — b(s)] its greatest value. 

It is worth remarking that the same result would follow at once if one could assume 
that for sufficiently large n the probability distribution of breaking extension is 
independent of that of the number of threads surviving up to the point of rupture, 
the probability of this number being then distributed according to a simple binomial 
law. Subsequent work on breaking extension suggests that the assumption is in fact 
true, but there seems to be no a priori justification for it. 

Note also that I have assumed the dispersion of cc to be negligible compared to 
that of £, but when n is large it may become of comparable order, in which case 
the formulae are not strictly valid. 

Part II of this paper will contain a generalization to threads having varying load- 
extension curves, a discussion of the probability distribution of extension at break, 
and a further investigation into the asymptotic form of B n by a more powerful 
saddle-point method based on the special properties of B n when b(s) = e~ Kls . 

My thanks are due to the Director and Council of the Wool Industries Research 
Association for permission to publish this work. 


References 

Peirce, F. T. 1926 J. Textile Inst . 17, T. 355. (See also preceding papers of the,series, with 
O. Midgley.) 

Weibull, W. 1939 IngenVetenskAJcad. HandL no. 153 (in English.) 



Stationary electron swarms in electromagnetic fields 

By D. Gabor, Dr.Ing., F.Inst.P. 

(Communicated by 8 . B. Milner , F.R.S.—Received 12 Juyie 1944) 


Electron clouds rotating in axially symmetric magnetic fields have been known for a long 
time, but the agreement between theory and experiment is still very unsatisfactory. The 
discrepancy appears to be due to the interaction of electrons. Before approaching this 
difficult problem it is desirable to possess a more complete theory of stationary swarms 
without interaction. In the present paper the distribution density is calculated on the basis 
of classical statistical mechanics. It is shown that electrons injected at any point with very 
small initial velocities will distribute themselves with a density inversely proportional to 
the distance from the axis, in a certain annular space. Only the limits of this space, not the 
distribution inside it, will be dependent on the electric or magnetic fields. The uniform or 
nearly uniform distributions calculated by previous authors are singular solutions, incon¬ 
sistent with any degree of statistical disorder. Other laws of density distribution can be 
realized by simultaneous injection of electrons at several points. These offer a possibility to 
realize dispersing electron lenses and corrected electron optical systems. It is shown that the 
ring current produced by the rotating electron cloud can reduce the magnetic field at the 
axis very considerably in devices of practicable dimensions. It appears also possible to 
produce clouds of free electrons with densities sufficient for observable optical effects. 

* 

The theory of the stationary motion of electron assemblies in electric and magnetic 
fields is of interest in two fields of applied electronics. It arises in the theory of the 
magnetron, an electronic device which so far has defied all attempts at a quantitative 
theoretical explanation. The second field is electron optics,, which suffers from the 
fundamental limitation that lens correction is impossible so long as only electro¬ 
magnetic fields free of space charge are employed. Hence the application of electron 
clouds offers one of the very few prospects for farther development of electron optical 
devices, especially of the electron microscope. 

The full-anode magnetron, a diode with a cylindrical anode and a coaxial filament 
as cathode, was first constructed and investigated by A. W. Hull in 1920, and has 
been explored theoretically and experimentally by numerous authors.* An especially 
thorough theoretical study is due to L. Brillouin ( 1941 , 1942 ). These investigations 
have revealed a striking discrepancy between theory and the experimental results.f 
Hull’s simple theory led to the conclusion that current could flow to the anode only 
above a certain critical voltage, which increases quadratically with the magnetic 
field intensity. Below this voltage the current ought to be cut off, and an electron 
cloud of uniform charge density 

8nmc 2 ^ 


should rotate in the magnetron like a solid body, with an angular velocity 


0 )# 


eH 
2mc 9 


( 2 ) 


* A. W. Hull (1921, 1924}. F. B. Bidduek (1936). Extensive bibliography in. A. F. Harvey 
( 1943 )- 

t See especially Harvey (1943, pp. 83 - 113 ) and also the Foreword by E. B. Moullin. 

[ 436 ] 



Stationary electron swarms in electromagnetic fields 437 

which is one-half of the £ Larmor frequency 5 . Though the experiments confirmed 
the existence of a critical voltage, the quantitative agreement was very unsatis¬ 
factory. Current starts flowing considerably below the critical voltage and reaches 
its saturation value only very gradually, whereas on Hull’s theory, even making an 
allowance for the initial velocities of the electrons, the rise ought to be confined 
within a few tenths of a volt. Though later workers added some refinements to Hull's 
theory, they were unable to account for the width and shape of the cut-off character¬ 
istic. An exception is an investigation by E. G. Linder (1938 a, 6), who found experi¬ 
mentally that in a rotating electron cloud random motion develops with a very high 
electron temperature, of the order of 10 V. On this basis he could account for the 
width of the cut-off curves. 

Several objections could be raised against Linder’s results. He finds by probe 
measurements a particularly high electron temperature when the current is entirely 
cut off, which is a thermodynamical impossibility. His method of calculating the 
current from an electron cloud to a plate is also open to objection, as he uses a formula 
which is strictly valid only in the absence of a magnetic field. Nevertheless, it appears 
very likely that Linder has correctly located the root of the discrepancy. The new 
difficulty arises, however, that theory cannot account for the experimentally found 
rapid development of random motion in an electron cloud. This phenomenon is 
closely related to the inexplicably rapid establishment of electron temperature in 
gas discharges at very low pressures, which was discovered by I. Langmuir and 
H. Mott-Smith, and discussed in detail by Langmuir (1927, 1928). As the problem 
of electron interaction is one of extraordinary mathematical difficulty, it appears 
desirable to obtain first a reliable experimental estimate of the effect. The first step 
towards this must be a more complete investigation of the problem without inter¬ 
action. It will be shown that no satisfactory basis for this has existed up to now, as 
the corrections which have to be applied to Hull’s results are very much larger than 
those found by any of the later authors who tried to improve on them. 

Hull’s results (1) and (2) can be derived from the condition of zero radial current. 
In general a zero current would be the resultant of two equal and opposite currents, 
one flowing inwards, the other outwards. If the electrons start from the cathode with 
zero initial velocity, as assumed in the simple theory, reversal of the sense of motion 
can take place only at the outer boundary of the electron cloud. By formulating the 
condition of equal and opposite currents, combining this with the dynamical equa¬ 
tions and substituting it into Poisson’s equation, a differential equation for the 
space-charge distribution is obtained.* The remarkable result follows, that this 
equation has a solution free of singularities only in the case when both the opposite 
currents are zero, that is to say, if there is no radial electron motion at all. This means 
that one must imagine the electrons circling around the cathode in coaxial circular 
orbits. The question immediately arises how such a state of motion could ever 
establish itself. Some authors have thought that it could be justified, as they found 
that small radial velocities produce only insignificant deviations from Hull’s law of 

* Cf. e.g. Brillouin (1941). 


Vol. 183. A. 


k 9 



438 


D. Gabor 


distribution. It will be shown, however, that any initial velocities, however slight 
require a fundamental modification of the picture, and that Hull's results must be 

considered as a singular solution of the problem, which could neither establish nor* 
maintain itself. 5 


This is of great importance from the point of view of the other field mentioned at 
the beginning, the electron optical applications of space charges. Electron optical 
systems cannot be corrected at present for spherical aberration and other defects 
because no dispersing electron lenses can be realized without (negative) space 
charges. A space charge according to equation (1) would be useless for this purpose. 
As explained above, this distribution satisfies the condition that electrons are every¬ 
where in radial equilibrium. Hence, if an electron beam is shot through such an 
electron cloud, the epncentrating effect of the magnetic field would exactly balance 
he dispersing effect of the space charge, and the resulting lens effect would be nil. 
But, as it will be shown that stationary electron clouds are possible in which radial 
equilibrium exists only in the average, but not at every radius, the prospect of 
electron optical applications is not closed. The problem will therefore be formulated 
generality to cover both the magnetron and the electron optical 


The general method to be followed will be to treat the problem as one of statistical 
mechanics. First the distribution law will be derived on the basis of classical statistics, 
gmng the density as a function of the position and of the electromagnetic potentials. 

s ^fP v * 0 - be to substitute the density into Poisson’s equation and find the 

FortuIaM^ fiV f d 1“ COnsisten1i with ih > as a function of position. 

Fortunate^, the first step leads to an extremely simple law, so that the second step 

and the discussion of the solution will present no difficulty P 


1 . XHE DYNAMICAL PROBLEM 

an axia % symmetrical electromagnetic field, specified in cylindrical 

* Z1T MC P ° tential “d a vector potential 

Afrr) As the vector potential has only a tangential component, it will not be neces- 

ry to write it as a vector,- or to distinguish it by a suffix t like the other tangential 
relation! ^ ^ ^ ** Gaussian ‘ The ma g netio field follows from A by the 

*- ou (3) 
In the special case of a homogeneous magnetic field, H r = 0, H. = H 0 

A = %H 0 r+C/r, ^ 

Ab “ does not ^ ^ ° 



Stationary electron swarms in electromagnetic fields 439 

illustrates an example. The cathode is placed in, or near to, a position of zero magnetic 
flux. In the case illustrated, the field at the axis has the law 

H(0,z) — const, (l + tan -1 ^), 

so that at great distance from the cathode, in either direction, the field becomes 
homogeneous, the field at the right being three times stronger than the field at the 
left, which plays only the part of an auxiliary, with the purpose of adjusting A Q to 
a suitable (small) value. The magnetic field is illustrated by the field lines, i.e. the 
meridians of the tubes of constant flux W = 2nrA = const. The flux increases by 
equal steps from one line to the next. The field line passing through the cathode is 
shown in dotted lines. It approaches a radius r 0 at the far right, where the field 
becomes homogeneous. This arrangement is suitable for electron optical applications, 
as it leaves the axis free. The cathode has to be placed at’or near the line W = 0, as 
otherwise the electrons could not get near the axis. 


4^=0 



LINES OF CONSTANT MAGNETIC FLUX , Y* CONST. 



MAGNETIC INTENSITY H a ON THE AXIS. 

Figure 1 . Example of magnetic field. 


The dynamical equations are most easily formulated in the Hamiltonian form. 
The linear momentum of the electron in an electromagnetic field is defined as 

p = mv--A. (5) 

c 

In an electromagnetic field of rotational symmetry it is advantageous to write the 
components of the linear momentum p s , p r and p 0 /r, where p & is the anghlar momen¬ 
tum. The Hamiltonian is 

The potential <j> is to be measured from the cathode as zero level. 

* R. Becker (1933, p. 96 ). The equation ( 5 ) is due to K. Schwarzschild (1903}. 


29*2 






440 


D. Gabor 


The problem has two integrals, 2? = const, and p e = const. Both can he im¬ 
mediately obtained from the canonical equations. Marking values relating to the 
cathode with a suffix 0 , the momentum integral is written 


Pe = r{mv t -^A^ = . 


IVom this point on it will be convenient to measure the velocity in potential units. 
Therefore v is replaced by 

u = (*J[m/2e]) v. ( 8 ) 

It will also be convenient to introduce a quantity, which could be called the vector 
potential relative to the cathode, defined by 


r.)H4 


rs/ ^ a measure of the flux which passes between the cathode (a, z 0 ) and the circle 
(r, z). sd ha,s the same dimensions as u. With these new symbols equation ( 7 ) becomes 

ru t —au i0 = rstf. ( 7 . 1 ) 

The energy integral is, in the new uni ts, 

ul+ul+u 2 t -<j> = {u 2 r + u 2 + uf) 0 . ( 10 ) 

. A condition which electrons have to fulfil in order to have access to an element 
m phase space can now be formulated by eliminating u l0 between the equations 
) an ( 10 ).. It will be convenient to introduce a dimensionless parameter 

* = r/a ( 11 ) 

for the radius, i.e. to measure r in units a. Thus 

Q = ul + uf-(x Z - + ( 12) 

This is the criterion of accessibility. Its geometrical interpretation in velocity 
space is shown in figure 2. Q = 0 is a quadric surface of rotational symmetry relative 
to the « r axis. Its type is given by the following table: 


0>#W 2 /(a; 2 -~l) 
<x 2 s/ 2 /(x 2 — l) 


<1 

real ellipsoid 
imaginary ellipsoid 


one-shell hyperboloid 
two-shell hyperboloid 


This is illustrated in figure 3 in the special case of a homogeneous 

magnetic field, in which s 




where K = a<J(eH 2 /8mc 2 ). 



Stationary electron swarms in electromagnetic fields 441 

Now consider electrons which have left the cathode in the velocity interval 
u 0 , u Q + du 0 . They will arrive at some point (z, r) with a velocity u, and in an interval 
du, connected with the initial data by the relations 

= udu = u Q du 0 . (14) 

Combining this with equation (12) it is seen that electrons which have access to 
(z 9 r) will all be contained in that segment of the spherical shell with radii u,u + du 
which is (seen from the axis) outside the quadric surface Q = 0. This shell segment is 
the accessible volume in velocity space. It may be noted that it gives also a measure 
of the volume of momentum space, as, by equation (5), 



Figure 2. Construction of the acces- Figure 3. The quadric surface Q. 

sible volume of momentum space. 

2. The law of velocity distribution 

By Liouville’s theorem the density D of a group of electrons in phase space is the 
same as the original density which this group had at the cathode. Hence if it were 
known that the whole accessible volume of momentum or (velocity) space calculated 
in the previous section would be actually filled with electrons, then the velocity 
distribution at any point would also be known, and from this the jjjensity in con¬ 
figuration space could be calculated. This condition is fulfilled in an important 
special case, in that of the infinitely long magnetron with cylindrical cathode, the 
only one considered by previous authors. In this case there is no further limitation 
of the accessible volume, as it is easy to see that a trajectory continued backwards 
from any point of it must actually reach the cylindrical cathode at some point. 


442 


D. Gabor 


In the general case Liouville’s theorem by itself is not sufficient to determine the 
velocity distribution, for two reasons. First, in the case of limited cathodes in general, 
only a part of the accessible volume will be really filled. This difficulty arises especi¬ 
ally in the case of an 'ordered’ flow of electrons. Hull’s distribution with electrons 
circling in circular orbits is an extreme example of such an 'ordered’ flow. But this 
uncertainty will exist also in all cases of motion in 'short trajectories’, i.e. if the 
electrons return to the cathode after spending only a short time in the cloud. There 
is, however, also a second sort of difficulty, which arises especially in the case of 
'long trajectories’. The same group of electrons might return—and by the theorem 
of Poincare and Zermelo will return—an indefinite number of times to the same 
element in phase space, if it is allowed to spend a long time in the cloud, and the 
number of times this will happen is not known. Both difficulties are resolved if this 
time can be assumed to be very long, by the ergodic theorem —long known as the 
ergodic hypothesis, until G. D. Birkhoff and J. v. Neumann proved it—according 
to which the whole accessible phase volume will be filled, with uniform density.* 
Moreover, this result is independent of the initial conditions, in the sense that though 
originally (at the cathode) D might have been a function not only of u 0 but also of 
the direction, ultimately, over long intervals it can be a function of u only. 

One can therefore apply the law of uniform distribution in the accessible part of 
phase space to both the principal cases under consideration, to the magnetron and 
to the space-charge electron lens, as shown in figure 1, though for different reasons. 
In the first case it is justified by Liouville’s theorem, in the second by the ergodic 
theorem. The assumption of very long trajectories appears well justified in this 
second case, as the cathode can be made very small in comparison to the rest of the 
volume, which the electrons will traverse in general many times until they return 
to the cathode. 

It may again be emphasized that the problem is to be treated dynamically as a 
one-electron problem, and that electron interaction must be expected to make itself 
particularly felt in the case of long trajectories. It may be repeated that the present 
purpose is not to establish a generally valid theory of magnetrons and similar devices, 
but to build a basis for such a theory by first investigating the conclusions of a 
theory without interaction. 


3. The law oe space-charge distribution 

On the basis of this assumption the volume of the accessible momentum (or 
velocity) space associated with any point is a measure of the space-charge density 
at this point. Tjo calculate this, one puts, as in figure 2, 

v * = u 2 cos 2 cc , s i& 2 (16) 

* It appeals that the present application is proof against the weighty arguments which 
have been advanced against considering the ergodic or, the quasi-ergodic theorem as the 
basis of classical statistical mechanics. Cf. R. C. Tolman ( 1938 ), p. 70. 



Stationary electron swarms in electromagnetic fields 443 

Substitution into equation (10) (with the equality sign) gives the following equation 
for the intersections of the sphere of radius u with the quadric Q = 0: 


(ucosa—'Sd) 2 



(17) 


Physically this means that the extreme electrons reaching a point will he those which 
started at the cathode in tangential direction. The limits are (u 0 being considered 
always as positive) 


u cosa = ±-u n . 

r 


(17-1) 


Four cases must now be distinguished, which give four different laws for the 
space-charge distribution: 

(a) The equation (17) has two real roots, a v a 2 . The condition for this is 



(a) 


If this condition is fulfilled (as in figure 2), the volume of the shell segment is, apart 
from a factor 

(cos a x — cos a 2 ) v?du. 

Combined with (17-1), which gives 

w(cos a, — cos a 2 ) = 2 - u 0 , (17-2) 

T 


and taking into consideration that udu = u 0 du 0 , one obtains for the charge density, 
apart from a constant factor, which may be included in the phase density D, 


dp = ~D(u 0 )u%du 0 . 


(18) 


This means that if electrons are emitted only in an infinitesimal energy interval 
(‘microcanonic assembly’) the density will be inversely proportional to the distance 
from the axis, independently of the electric and magnetic fields, which determine 
only the limits inside which it is valid, but do not interfere with the distribution 
itself. This strikingly simple law holds also if the primary emission is not homo¬ 
geneous, up to a certain maximum initial velocity. This law is the most important, 
but for completeness the other cases will also be discussed. 

(6) The equation (17) has only one real root. The condition for this is 

(«^-®« 0 ) <0+^<(j/+% o ) . (b) 

In this case the spherical segment becomes a spherical cap. The accessible 
volume is 


(1 —cosa) u % du. 



444 D. Gabor 

and the density will follow the law 

dp = [^ + H)--r + % 0 Jl)(^ 0 )^§dw 0 . (19) 

Unlike (a) this depends explicitly on <f> and stf. 

If equation (17) has no real roots, this can mean two things: 

(c) The whole volume of the spherical shell is accessible. The law of density is 

dp = *J(<f)+ul)D(u 0 )u%du 0 . (20) 

This is the case if the whole sphere is outside the quadric, which in this case must 
be an ellipsoid. 

(d) If, on the other hand, the whole sphere is inside the quadric, the accessible 
volume is zero, and so is the density. Therefore in the following this case will be 
referred to as (o). 

One must now determine the range of validity of these laws, in terms of <f>> , u Q ' 
and r (or x). The discussion, though elementary, is rather complicated, and only 
the results will be given. The velocity ranges in which different solutions (a)-(d) 
are valid are separated by roots of the equation 

^r Uo ) 

which are u l9 [j& ± *j{<sf 2 - (x 2 -I)(<f>-~ J& 2 )}]. (21) 

x x 

Of these roots, if they exist, the smaller one is called u x and the larger one w 2 . It 
will also be convenient to introduce the abbreviation 

k = x 2 /(x 2 — 1). 

The following table gives the intervals of u 0 in which the laws (a), (6), (c) and {d) 
are valid: 


x< 1 



Kjtf 2 > <j> 

(a) 

0-w 2 

— 

— 

(6) 

— 

— u 2 

— 

(«) 

Wo“ 00 

w 2 ~oo 

— 

(o) 

— 

0 — u x 

0 —oo 

x> 1 

(j) > KS&* 



(a) 

0 — co 

O 

1 

£ 

1 

8 

w 2 —00 

(6) 

— 

* u 1 —u 2 

* u x — u 2 

(«> 

— 

— 

— 

(o) 

— 

— 

0 — u x 


This table is illustrated in figure 4 in the particular case a = r 0 , i.e. if the cathode 
is in the homogeneous part of the field. In this case the curves which divide the 
field are 

*/**=(*-l)\ 0/JF 



445 


Stationary electron swarms in electromagnetic fields 

as given by equation (13). In the general case the curve 4> = has a different 
shape, as it goes to ± oo at x = 1, but the relative position of the two curves remains 
the same, and nothing essential is changed. 

The table discloses the extraordinary variety of equilibrium shapes of rotating 
electron swarms which could be obtained with large initial velocities. It may be 
somewhat surprising that the initial velocity adds to the manifoldness of the 
problem, as one might think that lowering of the cathode potential would be 
equivalent to an increase of u 0 . But u 0 appears in the equations not only in the 
combination <f> + u%, but also in the combination au 0 /r . This is a consequence of the 
momentum integral, and means physically that a velocity u Q at the radius a is 
6 worth J a velocity au 0 /r at the radius r. Things become simple only at small initial 
velocities, as in this range there obtains either the simple hyperbolical law (a) or the 
law (o). 



Before discussing law (a) in detail^ it may be interesting to compare it with the 
corresponding law in a two-dimensional magnetron, that is, with an electron swarm 
in which ergodic disorder would exist only in the two dimensions r and 6, but not 
in the ^-direction. This would be the case if the field were absolutely cylindrical, and 
if the electrons had definite, e.g. zero, initial components u zQ . In this case the phase 
volume changes into an area, which is 

<*-«*>■* - L 008 7FR) - o ” 8 ' 1 * 

\ 

and series development gives the approximate formula for the density 

p~(a l — a 2 ) ~ ^ + l a ~ j r ~^ 2 + 2au 0 ^/r ]-- ^ 2 -2to 0 ,af/r]). (22-1) 





D. Gabor 

If 0 -j/ 2 > (2 ajr) Uystf, this can be further simplified and gives 

p = const./rV(?5-j/ 2 ). ' ( 2 2 . 2 ) 

This is a law entirely different from any of the previous ones. Such sudden changes of 

mlh? ™ ber ° f d6grees 0f freedom are familiar to classical statistical 
mechanics and are a warning to be cautious in the application of statistical prin- 

T ® S ‘ u m tlie P resent °ase there is good reasbn to think that the axial degree 
of fr^dom can be considered as ‘fully excited’. In the case of the magnetron with 

Liouvill 7 th Cath ° de this foUows ’without any application of statistics from 

' mne6ment “ %uie 1 *™* Wrc * Itod 

It may be mentioned that a further step in the same direction, i.e. restricting the 


4. Self-consistent stationary electron swarms 

ttedeTTdisa-^ion i> that for Tory small initial velocities 

kw Vf “ «» region in wind, j >*, and will be 
eroo^ta^rl. The more completed laws (6) and <«) come into action only where 

ap f* *T ° SheatllS ” a ‘ the of ‘1>« electron elondfthe 1/r 

f'“Tf* ^ f ° r 6l “ tr0M fa an infinitesimal energy intenml 

but also for electrons emitted by thermionic cathodes ' 

PcisJ^uado^ ““ P °‘““ ' B8 ‘ lfl>U ‘ i0n °" e mUSt ""fa ‘tie law into 


S 2 ^ d*<j> 

9z 2+ lP + r 0F 


10 $_C 


(23) 


where C is a constant, to be determined later The solution min j i 

in the cylindrical ease dU to* - o l • u , , utl0n wdl be ^cussed only 

The general solution in the cylindrical case is 

, 0 = CV+q + Csjlog r/r 0 , (24) 

wh^e c x and c 2 are constants, and r 0 is as defined in figure 1. 

insfcte Sd r°- r °o^faide i AfIT'? app ” ![imi “ i< ” 1 . «k“P bonndaries at r - r, 
botmdary conditions are ^ “* TOl0City * tangential. The 

<P(ri) = ^{r x ), tfi(r 2 ) = j/»( ri ), (26) 

and in addition at the inner edge dfi /dr = 0, 

(26) 



Stationary electron swarms in electromagnetic fields 


447 


if there is no charge inside r x . This condition is valid in the case of figure 1 because 
there is no electrode at the axis, but it is valid also in the case of the cylindrical 
mag netron, as the field strength at the cathode must be zero in the equilibrium 
condition. There are therefore three conditions for the five constants C, c x , c 2 , r x 
and r 2 , and two degrees of freedom are left over. One of these corresponds to the 
anode potential, which can have any value below the critical potential, and the 
other to the degree of saturation. The total charge in the cloud may assume any 
value up to a certain maximum. This will be reached if r x = r 0 and$(r 0 ) = 0, which 
means that no further electrons can reach the cloud from the cathode. 

Having assumed (j) independent of z, the same assumption must now be made 
for stf. Hence, neglecting the magnetic effect of the rotating electrons, which will 
be dealt with later, a homogeneous magnetic field 
must be assumed, writing 


aff 8 
8 me 8 


<-?)*• 


Substituting this into the boundary conditions 
(25) and (26), the following expression is ob¬ 
tained for the space-charge density at a radius r : 

P( ipH r [y 2 - r x ( 1 + log r 2 /r x )] ’ 

where p^ is Hull’s space-charge density, as de¬ 
fined in equation (1). This is the solution of the 
problem, with two free parameters r x and r 2 . It is 
illustrated in figure 5 in the special case r 2 /r 0 = 3 
for various radii r 1? which correspond to various 
degrees of saturation. The total charge contained 
between r x and r 2 is marked on the curves in per¬ 
centages of the maximum charge. 



o i > —3 

Figure 5. Potential and density 


distraction. 


nuagcid UJ. UUC JLU.CVUVXXXJLU.JUUL VJLiWLgv. 

It may be noted that this result gives an a posteriori justification of our procedure. 
The law (a) has been assumed to hold, that is to say <j> and ^Iso, that apart 

_ . .in . t T I J? J 1 _ *_ 11 . * — 1 waT a ni+i nn 


This assumption is now verified, except at 100 % saturation, where the two curves 
come rather close together, near the inner edge r a . 

In the case of maximum saturation, i.e. r x = r 0 , the density at. the outer edge 
becomes, according to equation (28), 

p( r t) — IPby. 


[ l -( W ^) 2 ] 2 


(28-1) 


( r ol r 2 ) log ( r il r o) ’ 

and for large ratios r 2 /r 0 this approaches one-quarter of Hull’s value. Applying this 
to the cut-off characteristic of the cylindrical magnetron an interesting difference 
arises between the present and previous theories. The sudden jump of current .when 
the critical voltage is exceeded ought to be only about one-quarter of the saturation 






448 


D. Gabor 


value, instead of the whole amount. There are, indeed, measured magnetron cha¬ 
racteristics which show this phenomenon,* but it would be premature to consider 
this as a confirmation of the theory. The present theory cannot account—and no 
one-electron theory possibly could account—for the discrepancy of the voltage at 
which current starts to flow. Also, the present theory is valid only so long as the 
cloud is static, i.e. so long as there is no current. There is the possibility that even 
small currents might modify the calculated distribution very considerably. 

The density at the inner edge r x in the case of saturation is about which 

can far exceed p R . But this is not necessarily the largest density which can be 
maintained in a magnetron of prescribed outer radius. Tor very large ratios T 2 jr l9 
the density at the inner edge may be written approximately 




This has a maximum at = <j( 3) r§/r 2 , 

i.e. when r x is 1-73 times larger than the smallest radius which electrons can reach 
at all at a given outer radius. The value of the maximum is 

. P{r x ) max. = 0-094p^(r 2 /r 0 ) 2 5 (28-3) 

which can exceed the value of /o(v 1 ) at maximum saturation by any amount if 
r 2 /r 0 is made sufficiently large. This effect begins to develop at about r 2 /r 0 = 4. In 
the example shown in figure 5 the largest density is still reached at maximum 
saturation. 

In what limits would such an electron cloud act as a dispersing lens? The radial 
electric gradient is 1 

XT = G+cJr = C{l—r x jr) = -4 7 rp(r 1 )r 1 (l-r 1 /r). (29) 


This must be compared with the gradient in the case of Hull’s distribution, which 
would just suffice to keep the electrons in equilibrium 


m 

,dr) H 


= -27 rp B r. 


Substituting in equation (29) the maximum value of p(r x ) from equation (28*3), the 
following criterion is obtained for a dispersing lens: 


dip I dr 
(d<pjdr) H 


= 0-188 




> 1 . 


(30) 


A dispersing effect will therefore always exist in a certain interval of r so long as 
r zi T i > 4*6. But this effect is confined between two limiting radii, between which it 
reaches a maximum value. 


* Cf. Harvey ( 1943 , p. 105, figure 43a). 



Stationary electron swarms in electromagnetic fields 449 

This is not the law required of a dispersing lens, even less of a lens which could be 
used to correct the aberrations of ordinary electron lenses. In a correcting lens the 
density is required to start at the axis with a value exceeding Hull’s value, and to 
increase approximately parabolically with the distance from it. 

It appears likely that a distribution similar to the required one would establish 
itself in any arrangement of the type as shown in figure 1, unless it is constructed with 
extraordinary precision. At some distance from the cathode, where the magnetic 
field has become appreciably homogeneous, the ‘axis’ of the magnetic field loses its 
significance, and the momentum integral is valid only if the electric field is rigorously 
rotationally symmetric and perfectly alined with the cathode. Very small departures 
from these ideal conditions will cause the charge distribution to depart appreciably 
from the calculated form. The hollow tube inside the radius r 1 will fill up with elec¬ 
trons, and one may expect that within certain limits the real distribution will 
approach the desired shape. 


5. Extended electron source 

These processes may be followed to a certain point by a simple extension of the 
foregoing theory, if one considers cathodes of a certain radial extension instead of 
filament cathodes with vanishing thickness and perfect alinement. 

The density contributed by the emission of a cathode of vanishing thickness, 
which emits in the limits u 0 , u 0 + du 0 , was found to be 


dp = -D(u 0 )v%du 0 . 

T 


(18) 


Now assume that u 0 and du 0 go to zero, whilst the product D(u 0 ) w§dEt 0 divided by 
the radial extension da of the cathode approaches a finite limit. It will be con¬ 
venient to express a by r„ (as defined in figure 1) and include it in a function F(r 0 ), 
with which the density due to the emission of an infinitesimal strip of the cathode 
is written 

dp = ±F(r 0 )dr 0 . ■ (31) 


If the cathode has constant emission density, the function F(r 0 ) is proportional to 
the cathode area corresponding to the limits r 0? r 0 + dr 0 , multiplied with the radius a. 

Assume in the following, to simplify matters, that the cathode extends to r 0 = 0, 
i.e. to the flux line ft = 0. Investigating only a region near the axis in which only a 
part of the cathode is active, the area outside a maximum radius r om . is cut off from 
contributing to the density by the momentum integral. This gives the following 
law for the potential and density distribution: 


— 4c7Tp = 


dfy C 

dr*^~rdr r 



(32) 



450 


D. Gabor 


The radius r om _ is determined by 6 = jtf 2 , and in a homogeneous magnetic field' 
(equation (27)) this gives 

r tm. = r2 +j^S- (33) 

As a particularly simple example assume F(r 0 ) = const. This gives a density 
proportional to 

J { 1 + Tr^)- 

Putting tp-r* this becomes a constant, and the equation (32) is satisfied. Hence, in 
order to realize a dispersing lens in -which at least near the axis the radial force is 
exactly proportional to the radius, the cathode must be arranged in such a way 
as to make F(r 0 ) = const. This problem can easily be solved once the magnetic 
field is given. How far such a lens would in fact possess the desired qualities, and 
how far electron interaction would interfere with its performance, only experiment 
can show. 


6. Magnetic effect of the rotating space charge 

Up to now, in the examples though not in the general formulae, a homogeneous 
magnetic field has been assumed. One must now check under what conditions this 
assumption is justified. 

The rotating electron swarm represents a ring current of intensity i t = pv t . By a 
well-known proposition regarding the centre of gravity of a spherical shell segment 

v t = £v(cos aq + cos a 2 ), 

and introducing again u instead of v, from equation (17-1) the simple result 


is obtained. 1 v”) 

. ° utside the radius r o the cloud rotates in the direction of the vector potential A 
mside it m the opposite direction. It may be noted that the maximum departures 

from the average are ±au 0 /r. Hence on approaching the axis one finds increasing 

tangential velocity differences, at the same time as increasing electron density so 
that electron interaction ought to play a particularly prominent part in this region. 

The nng current has such a sign that outside r 0 it opposes the current which 
produces the magnetic field, whereas inside r 0 it increases the field. For large ratios 
r zl r o the shielding effect far predominates. 

J?^ ™ ddl8 regi °° ° f 1,18 in which the magnetic 

field is parallel to the z-direction. The law of its distribution is 


— eurlH = 


dB, 

dr 


4:71 . 477 

~T tt== ~T pVi ' 


( 35 ) 



Stationary electron swarms in electromagnetic fields 451 

Using equations (3) and (34) H. and v t is expressed by means of the vector potential 
A, and denoting differentiation with respect to r by primes, 


is obtained, where 


A ' + r A ' ,» A ~A>[" 4 r ' 1| '] ipj ’ 

(36) 


(37) 


A is a characteristic length, which is 1 jin ^2 = 0-1125 of the vacuum wave-length 
associated with the frequency <>) H . Numerically 


Aj H = 2410cm.gauss. 


(37-1) 


The effect of the ring current will be appreciable only in large magnetrons in which 
the radius is of the order of A. 

Now write the charge density in the form 


P = 


P(^) r 

Ph 



(38) 


The characteristic radius R will approach Jr 2 for large ratios r 2 /r 0 . The equation 
(36) now becomes 




Ra 

(A^ 0 ' 


(36-1) 


This has the particular solution 4 A = aAJr, 

which is of no importance, as it does not contribute to the magnetic field. The solu¬ 
tion of the homogeneous equation is 


A = Z 2 (i *y[4l?r/A 2 ]), (39) 

i.e. a cylindrical function of the second order with imaginary argument. The solution 
is therefore a sum, with constant coefficients of the modified Bessel and Hankel 
functions / 2 (l) and J ST 2 (£), if the symbol £ is introduced for the dimensionless 
parameter ^/(4Ur/A 2 ). 

For the constants there are two boundary conditions. The first is that at the outer 

radius r 2 the field intensity H z must assume a prescribed value. Substituting equation 

(39) into equation (3) and using the well-known relation between cylindrical functions - 

< 

Z' n +jz n = Z n _ x , 

then £*[C 1 I 1 &) + C 2 K 1 &)] = 2r 2 H(r 2 ), (40) 


where £ 2 is the value of £ corresponding to r 2 . 

The second boundary condition is not so obvious, as the field intensity at the 
inner radius r t is unknown. The condition is obtained by the following consideration. 
Though the solution does not extend to the axis, but only to the radius r u it ought 
to be possible to continue it, by adding ring currents inside, according to the same 



452 


D. Gabor 


law, without modifying the solution outside r v as currents inside a radius have no 
effect on the field outside it. This process of continuation leads at the axis to a 
magnetic field of the same direction as the outer field, which goes to infinity like 1 Jr. 
This gives the condition from which the coefficient C 2 of the modified Hankel func- 
tion K 2 can be determined. 

In the following, to simplify the discussion, only the case A 0 = 0 will be con¬ 
sidered, i.e. the cathode arranged on the line f = 0, in which case the electrons 
arrive at the axis with zero tangential velocity and no singularity arises. In this 
case the whole mass of electrons rotates in one direction, so as to oppose the outer 
field by its magnetic effect. The solution in this case is given by the modified Bessel 
function 7 2 (£), an d the magnetic intensity follows the law 


gfr) _ //r«\ ii(V[4ifr/A 2 J) 

V W4(Vt4Jfr«/A»])* 


(41) 


For small arguments approaches the limit Hence the magnetic field in- 

tensity at the axis becomes, if H is written for H(r 2 ) } 


H 0 = (41-1) 

and as in this case R = \r 2 , this can be written 

H 0 = IH (r 2 /A)/7 1 (r 2 /A). ( 41 - 2 ) 

The following table, calculated from the data of Jahnke & Emde (Funktionentafeln), 
shows the shielding effect of the ring current for magnetrons of different radius 
measured in units of A: 


rj/A 

0*25 

HJH% 

99*3 

r 2 /A 

5 

m% 

10*2 


0-5 1 

97*0 88*5 

6 7 

4*78 2-24 


2 

62*9 

8 

1*00 


3 

38*9 

9 

0*437 


4 

20*5 

10 

0*187 


The effect becomes therefore very strong in large magnetrons, not too large to be 
practicable. With H = 5000 gauss, for instance, A is a little less than 0-5 cm.; hence 
with a radius of 5 cm. it ought to be possible to reduce the magnetic field at the axis 
to less than 100 gauss. According to the present theory, therefore, it ought to be 
possible to produce extraordinary concentrations of free electrons, far larger tha.™ 
ever produced experimentally, and to study them under favourable conditions in a 
relatively weak magnetic field. It may well be possible to produce concentrations 
sufficient for observable optical effects. But it must be borne in mind that the higher 
the electron density the stronger the interaction, and the more remote the present 
theory must be from reality. 


I am indebted to Dr 0. J. Milner for many discussions on this subject. I tha. nk the 

Directors of the British Thomson Houston Company for permission to publish 
this paper. 



453 


Stationary electron swarms in electromagnetic fields 

References 

Becker, R. 1933 Theorie der Elektrizitdt. %. Elektronentheorie. Leipzig; Teubner. 
Brillouin, L. 1941 Phys. Rev. 60 , 385 . 

Brillouin, L. 1942 Phys. Rev. 62 , 166 . 

Harvey, A. F. 1943 High frequency thermionic tubes. London; Chapman and Hall. 
Hull, A. W. 1921 Phys. Rev. 18 , 13 . 

Hull, A. W. 1924 Phys. Rev. 23 , 112. 

Langmuir, I. 1927 Z. Phys. 46 , 271 . 

Langmuir, I. 1928 Proc. Nat. Acad. Sci., Wash., 14 , 627 . 

Linder, E. G. 1938 a Proc . Inst. Radio Engrs , N.Y ., 26 , 346 . 

Linder, E. ,G* 19386 J. Appl. Phys. 9 , 331 . 

Pidduck, F. B. 1936 Quart. J. Math. 7 , 201 . 

Schwarzschild, K. 1903 Nachr. Qes. Wiss. Gottingen , 3 , 126 . 

Tolman, R. C. 1938 The principles of statistical mechanics. Oxford Univ. Press. 


The electrical conductivity of an ionized gas in a magnetic 
field, with applications to the solar atmosphere 
and the ionosphere 

By T. G. Cowling, The University, Manchester 
( 1 Communicated by S. Chapman, F.R.S.—Received 26 July 1944) 

The methods of Chapman and Enskog are used to discuss conduction of electricity and 
diffusion currents in an ionized gas with several constituents, in a transverse magnetic field. 

The free-path formula for the conductivity is compared with that derived by the exact 
methods. The two formulae are identical in form if a correction is applied to the usual free- 
path method; this correction robs the method of much of its simplicity. The uncorrected free- 
path method, however, gives correct results for the electron contribution to the conductivity 
in all practical cases; and for the ion contribution if a large number of neutral molecules are 
present—e.g. in the earth’s upper atmosphere, about 5 x 10 6 times the number of ions (of 
both signs). 

Numerical values are given for the conductivity in the sun’s outer layers and in the earth’s 
upper atmosphere. Mechanical forces due to currents induced in moving material are shown 
to be very important in the sun, and in the F-layer of the earth’s atmosphere. The solar 
results are used to discuss the motion of solar prominences and eruptions. In the earth’s 
atmosphere, the observed collision frequencies of electrons are shown to imply upper limits 
for ion-densities in the E and F layers. The integral conductivities of the E and F layers are 
estimated, and it is shown that, on the dynamo theory of the lunar variation, of the earth’s 
magnetic field, tidal oscillations in these layers must be between 100 and 1000 times as great 
as those at the ground. Diamagnetism and drift currents are shown to make negligible con¬ 
tributions to the lunar and solar variations of the earth’s magnetic field. 

In an Appendix, the applicability of Boltzmann’s equation to strongly ionized gases is 
discussed. 

1. Introduction 

In discussing diffusion, and the conduction of electricity, in an ionized gas in a 
transverse magnetic field, some authors have used the free-path method, and others 
the more exact ‘ velocity-distribution 5 method, originally developed by Chapman 


Vol. 183. A. 


30 



454 


T. G. Cowling 

and Enskog. There are certain discrepancies between the results of the two methods; 
because of the relative simplicity of the free-path method, it has often been preferred 
to the other. 

In the present paper, the conductivity is further discussed. First, the discrepancies 
between results of the two methods are shown to be due to the assumptions implicit 
in them, and to an internal inconsistency in the free-path method, as often used. 
Thereafter, the velocity-distribution method alone is used. Examination is made of 
the error involved in using first approximations to the velocities of diffusion in a 
binary gas. Similar first approximations are then used to discuss the conductivity 
of a gas with more than two constituents. The results are found, in certain important 
special cases, to resemble those derived by free-path methods. Electric currents 
arising from gradients of composition and pressure are also discussed. Applications 
to the atmospheres of the sun and the earth are briefly considered. 


2. Eaelieb besults 

Suppose that a number of types of particle (neutral molecules, atoms, ions and 
electrons) are present; these are regarded as molecules of distinct gases. The mass, 
charge (in e.m.u.) and velocity of a molecule of the rth gas are denoted by m r , e ri c r ; 
the number-density, mass-density, partial pressure and velocity-distribution func¬ 
tion for the rbh gas are n r , p r , p r and/ r ; the number-density, mass-density, pressure 
and temperature of the gas as a whole are n, p, p and T. The mass-velocity c 0 of the 
gas is defined by 

pc 0 =j:p r c r , ( 1 ) 

r 

where c r is the mean velocity of molecules of the rth gas; and the peculiar velocity 
C r of a molecule is defined by 

G r — c T Cq. 

The gas as a whole is supposed to be electrically neutral; thus 

2» r e r = 0. (2) 

r t . 

The electric current-density/ can be expressed in the form 

j = T,n r e r c r = 2Xe r ( c o+^r) 

r r 

= I \n r t r 'C r . (3) 

r 

It therefore does not depend on the mass-velocity, but only on the velocities of 
diffusion of the different gases through the mass. 

The force on a molecule is divided into two parts. One part, e r C r AH, arises from 
the peculiar velocity C T of the molecule in the magnetic field H . The other part, 
m r F r , is independent of the peculiar velocity; it may include a term e r c 0 hH due to 
the mass motion of the gas. Since diffusion parallel to a magnetic field is known to be 



Electrical conductivity of an ionized gas in a magnetic field 455 

unaffected by that field, we consider only the case in which F r and all other factors 
producing diffusion are perpendicular to H. Thermal diffusion is not considered; 
that is, temperature gradients are supposed absent. 

A simple form of the free-path theory introduces the c spiral-frequency ’ 
o) r = e r Hjm r , and the mean time r r between successive collisions of a molecule of 
the rth gas with molecules of other gases. The collision-interval r r is supposed not to 
depend on c r It is therefore to be regarded as a mean collision-interval effective in 
diffusion; a comparison with velocity-distribution theory shows that, for rigid 
elastic spherical molecules, it must be taken as three-quarters of the collision- 
interval defined by the ratio of the mean free path to the mean peculiar speed. The 
free-path theory expresses the velocity of diffusion of the rth gas by 





( , '-£*)}/ <l+ “ W) 


( 4 ) 


(Chapman and Cowling 1939, p. 327). This implies that the diffusion of the rth gas 
is due to « 


r Pr * r ’ 


( 5 ) 


i.e. to factors depending solely on the rth gas. It has two components, one parallel 
to the vector (5), and one perpendicular to both this and H; these are the 'direct’ 
and 'transverse’ velocities of diffusion. The electric current-density, given by (3), 
is the sum of contributions arising independently from the different gases. 

The discussion by the velocity-distribution method has hitherto been limited to 
a binary mixture. It gives, as a first approximation to the velocity of relative dif¬ 
fusion of the constituent gases, 




pp (rd l2 - Hr l UT 2 H a d 12 ) 
PxPa l+o> 2 r 2 


( 6 ) 


(Chapman & Cowling 1939, p. 335), where 

o) = J E(e 1 »»2P2+-e 2 TO 1 p 1 )//»ra I m 2 = ((t>iPz+w s Pi)/P, 


and 


t = (TiPa+r^PiVp, 

PP l . Pi & pi 

3 ( n i\ . w 1 n 2 (m 2 -m 1 ) &p PiPz, F p 
dr\n) npp dr pp ' 1 2 


( 7 ) 

( 8 ) 


( 9 ) 


Since, from (1), 'LPr^r — 9, ( 10 ) 

r 

equation (6) determines the velocities of diffusion C l5 V 2 of the two constituent 
gases. 


30-2 



456 


T. G-. Cowling 

. Elions (4) and (6) have a superficial similarity, but this hides important 
differences. Equation (4), for a binary gas, makes the velocities of diffusion C,, C 
depend separately on the vectors 1 2 


F x -l d Jt I 

1 Pi8r’ 


F 1 8 ^a. 
2 ft8r’ 


equation (6) implies that each depends on the difference between the vectors. Again, 
(4) makes C x , C 2 depend respectively on w 1; r x and r 2 ; (6) shows that each depend^ 
on co, t, which are weighted means of co v co 2 and r x , t 2 . 

Eor example, if % = ra 2 , m x = m 2 , and e x = -e 2 , (7) gives w = 0; thus (6) indicates 
no transverse diffusion, and a direct diffusion unaffected by the magnetic field In 
this case, in fact, particles of the two gases have accelerations equal in magnitude 
and direction by reason of their equal and opposite velocities of diffusion in the 
magnetic field. Thus the field produces an acceleration of the gas as a whole- the 
diffusion of one constituent relative to the other is affected by the field only because 
the change m mass-velocity reacts on the forces producing diffusion. In indicating 
a transverse diffusion and a reduction in the direct diffusion, free-path theory con- 
fuses mass-motion effects with, diff usion. 

Free-path theory in any case tacitly assumes that the gas is not accelerating It 
calculates the mean velocity of molecules at a given point by grouping them ac¬ 
cording to their velocities at the beginning of the current free path, and in doing so 
it makes no distinction between the values of the mass-velocity at the beginning of 
the free path and at the instant considered. This tacit assumption must be borne in 
mm m comparisons with the velocity-distribution theory, which allows for an 
acceleration, though assuming that it produces only small changes in mass-velocity 

unng a free path. But the results of the two theories are not identical even for an 
unaceelerated gas. 


*>• Jb REE-PATH THEORY 

?? j discrepancy between the results of the two theories arises because, in the free- 
path discussion leading to (4), it is assumed that, directly after a collision, a molecule 
as, on an average, no peculiar velocity, i.e. that its velocity is, on an average c 
This assumption cannot, of course, apply to collisions between like molecuS as 
such collisions cannot; destroy the common velocity of diffusion; collisions between 

So SvSf S are ii m l negleoted 111 considering diffusion. But the assumption is 

also mvahd for collisions between unlike molecules; if the velocities of molecules m 

m 2 just after * mutual collision are distributed at random about a mean value & 
then (remembering that the collision-interval is supposed independent of the mole^ 
cular velocity) c' must satisfy the momentum equation 

(w^+mjc' = w 1 c 1 +m 2 c 2 
and so cannot in general equal c 0 . 



Electrical conductivity of an ionized gas in a magnetic field 457 

Consider an unaccelerated binary gas. It may, without loss in generality, sup¬ 
posedly be ^jb rest, since a change in c 0 is simply equivalent to a change in F r . The free- 
path argument yielding (4) assumed the mean velocity of molecules m v m 2 imme¬ 
diately after collision to be zero; assume now that it is c', given by (11). Then, 
the velocity of diffusion C r is increased by the velocity after a mean time r r of a 
molecule moving in the field H with initial velocity c', i.e. by 

(c'-H^ r T r HAC')l(l+vfi*). 

Hence* = (1 +«j7f)-»{c' + F r r,-i^-^HA(c' + P r T r -l^)}, (12) 


and here ~C r — c r . Since c' is comparable with the diffusion velocity, the new terms 
in this equation are not unimportant. 

Equation (12) gives c r ——™ c r a H = c' + F r T r —y . 


Now n 1 jT l = n 2 /r 2 , since each gives the frequency of collisions between unlike 
molecules per unit volume. Hence, using (8), 


and so 


n r /r r = n x n 2 (m x +m 2 )lpr, 


+wi 2 ) 
— 


{c r -c')-°^c r hH ■■ 


Pr*r- 


(13) 


Multiply this equation by p r and sum over values of r\ then since p r (o r = n r e r H, using 
(3) and (11), 

This is the condition for zero acceleration of the gas, which, accordingly, follows as 
a direct consequence from (11). 

Again, by (9) and (13), 


M-d 12 = 2 K(c 2 -c')-n 2 {c x -c’)}- c 2 - w, cj a H. 

P 1 P 2 P* ■ 

But, using (11) and (1), with c 0 = 0, 


Pi c i — ~Pz c z ~ 


n 1 n 2 (m 1 + m 2 ) ^ = p^p* ^ x -c 2 ). 
n 2 -n x p 1 


Substituting these values for c x , c 2 , c’ in terms of c x —c z , 


^~~d x2 — — (c 1 r-c 2 ) + -Vr(a> 1 /3 2 +a> 2 p 1 ) (c x — c 2 )aH, 

PiPi P H 

= — {c x -c 2 )+-g (c x —c 2 )aH 
\ 

* This result can also be derived directly, using methods similar to those of Chapman & 
Cowling ( 1939 , pp. 325-8). 



458 T. G. Cowling 

by (7). Thus the velocity of diffusion is given by 

7 = ppr {dn-n-iwHkdn) 

1 2 P 1 P 2 l+^V 2 

This equation is identical with (4). That is, the discrepancy between the free-path 
and velocity-distribution results is removed when the mean peculiar velocity of 
molecules just after collision is assumed to be, not zero, but c', given by (11). 

Because the diffusion velocity of molecules of one constituent gas of a mixture is, 
in part, transferred to molecules of other constituents by collisions, the diffusion 
velocity of any constituent does not depend wholly on factors affecting only that 
constituent. This deprives the free-path method of its chief simplicity. The free-path 
discussion cannot, moreover, itself determine the collision-interval r. We therefore 
use only the velocity-distribution method in later work. 


4. Binary gas; second approximation 

Equation (6) gives only a first approximation to the velocity of diffusion in a 
binary gas. The first approximation, like other first approximations of the velocity- 
distribution theory, gives exact results for Maxwellian molecules (molecules repel¬ 
ling each other according to an inverse fifth power law); the free-path assumption 
of a collision-interval independent of the molecular velocity is essentially equivalent 
to assuming Maxwellian molecules. In the absence of a magnetic field the first 
approximation usually gives results correct to within 2 or 3 % for non-Maxwellian 
molecules; this, however, is not true when the force of interaction between mole¬ 
cules follows the inverse-square law. For this law, when m 1 /m 2 is large, and inter¬ 
action between pairs of molecules ra 2 is neglected, the first approximation to the 
diffusion velocity is only about 0*3 of the exact value (Chapman Cowling 1939, 
p. 196). Since in an ionized gas the forces between molecules follow the inverse- 
square law, I have considered a second approximation to the velocity of diffusion 
for this law. 

The method of calculation is indicated in Chapman & Cowling (1939, pp. 333-4); 
here only the results are quoted. The velocity of diffusion is given, to any order of 
approximation, by an equation of the form 

(ccd 12 +H-ij3md 12 ). (i4) 

piPi 

The first approximation (6) corresponds to a = a 0 , = fi 0 , where 

a a +ifi o = 1/(1 +ionj. (15) 

If gw is large (magnetic field large, density small), the second (and all later) approxi¬ 
mations give values of a, ft indistinguishable from a 0 , ft 0 . If <ur is small, the second 
approximation gives a value of a greater than a 0 , and a value of ft numerically 
greater than ft 0 save when both of these two quantities are small. It gives a not very 



Electrical conductivity of an ionized gas in a magnetic field 459 

much greater than a 0 if m 1; m 2 are not very different;* e.g. if m 1 /m 2 = f, then taking 
intumfti/ftg = 2,1 and i, it is found that a/a 0 = 1-27,1*21 and 1-25, and/?//?„ = 1-65, 
2-31 and 0-41, the irregular variation of /?//?<, being due to the fact that fi and J3 0 
both vanish between njn 2 = 1 and nfn 2 = When m 1 /'m 2 is very large, a/a 0 may 
be considerably greater than unity, though even in this case it is not so great as when 
the mutual collisions of the light molecules are neglected. For example, if % = n 2 , 
we find a = l-96a 0 , /? = 4-88/J 0 ; if % = j(yn 2 (e x = — 10e 2 , corresponding to a highly 
ionized gas)^a = 3a 0 , /? = 13-3/? 0 . 

The theory of electrostatic interaction is, in any case, only approximate, and in 
most applications the data are very uncertain. It is therefore usually adequate to 
use the first approximation in considering electric currents due to transport of ions, 
in which the molecular masses are comparable; this is especially true when large 
numbers of neutral molecules are present, since interaction with these does not 
demand such large corrections to a 0 , j3 0 as inverse-square law interaction. In con¬ 
sidering the currents due to electrons, the error of the first approximation is too 
large to be ignored when on is small. The figures quoted above suggest that in this 
case, since fi 0 oc r, it is sufficient in (14) to use the first approximation with an in¬ 
creased value of r. 


5. Multiple gas 

Now consider diffusion in a gas with several constituents. The mass-velocity c 0 
is supposed to be uniform, so as to remove the viscosity terms from the equations. 
If the velocity-distribution function f r is regarded as a function of C r , r and t, the 
Boltzmann equation which it satisfies is of the form 


Wr 

Dt 




(cf. Chapman & Cowling 1939, p. 323, equation (3)). The first approximation to the 
solution of this is the Maxwellian function 



The second approximation is ff>(l +f r ), where 


and 


IrMr + <Ps)^\\f?m<f>r + fs-fr-<P' a )KsdkdC s . (17) 


* The constant A. appearing in the general expressions (Chapman & Cowling 1939, p. 164 ) 
is less, but not much less, than 0*4 for inverse-square law interaction; its value is here taken 
as 0 * 375 . 



460 


T. G. Cowling 


In evaluating the first and third terms on the left of (16), the following first 
approximations are used to the equation of continuity for the rth gas, and to the 
equations of energy and momentum: ° 


■ Dp n r _ ^ DpT _ n „D 0 c 0 ^ r , , , „ 3p 

rv u > tw - 0, p — _ Yip r F r +jKH——. 


Dt 


Dt 


(18) 


In the last equation, / is taken to be the current given by the second approximation 
to fr- first three terms on the left of (16) accordingly reduce to 

,ff Cr .^d r+ ^ h H 

In this, d r denotes the vector 

dr= h(ha_ Fr _i jkH+ i^ 

p\ Dt P J T Pr a r ) 


- 1/M (np r -n r p) dp 1 

dr\n) npp dr pp ^ PrPsi^r F s )> 


so that 


E<*r = 0. 


(19) 

(19') 

( 20 ) 


In general the force m r F r on a molecule is the sum of a gravitational force m r g 
and an electnc force e£ (where E may include a part c 0 a ff due to mass motion in 
the magnetic field). Thus (19') reduces to 

d - I (%& Pr^P „ e p \ 

r ~p\*r pdT n ^ E )- (21) 

Equation (16) now takes the form 

^/ r >C r .d r = -/®)C r .J^/A£r+^-ff A |^J_24( 9 5 r+5 5 s ). (22) 

The solution of this is of the form 

<}>r = — n E d s . {D}„ C r + D}} C r a H}, (23) 

S S b!“h C y° nS ° f ^ ^ ^ ^ H - ^ velocity of 


U r C r = jfP^ r C r dC T 

= ~ in J jf> Cl E {D} s d s + D}} Hk d a ) dc T . 


(24) 



Electrical conductivity of an ionized gas in a magnetic field 461 

Hence, in the absence of a magnetic field, each velocity of diffusion has components 
parallel to each vector d s . This implies that any one gas in the mixture may acquire 
a velocity of diffusion as a result of factors not directly affecting it, e.g. as a result of 
electric forces acting on other gases in the mixture, even when its own molecules 
are ^uncharged. As shown in § 3, this ‘induced’ diffusion is due to the transfer of the 
diffusion velocity of one gas to others at collisions. The introduction of a magnetic 
field alters the magnitude of the component of ~€ r parallel to any d s , and also pro¬ 
duces new components, perpendicular to H and to each d s . 

Similar remarks apply to the current-density/, given by 

j = J^n r e r (J r 

r 

= -n-Z(I%d s +L?HKd 8 ), (25) 

8 

where L] = i^jf^e r CfI^ s dc s , L? = f»e r C*D? a dc s . (26) 

If the current is due solely to an electric field E , 

d r = -n r e r E/p 

and so j = (T^E-bH-^HAE, (27) 

where a 1 = ^En s e s Lj, a 11 = ~En a e g L&. (28) 

Equation (27) implies that, in the presence of a transverse magnetic field, an electric 
field E produces currents of magnitude ar l E and cr u E, respectively parallel to E 
and perpendicular to both H and E. cr l and cr 11 may be called the direct and trans¬ 
verse conductivities. 


6 . First approximation to diffusion velocities 
Substituting for <j> r and j from (23) and (25) into (22), then 

m 

~f^C r .d r =ff^.^{-LlHKd a +H^Lfd s )+^C r .^{D} 8 HKd s -H^d^ 
+ S^(sd s .{^C r +2)gC,AH+i)J s C a + i)gC 8 AH}). (29) 

In this, the vectors d 3 are not independent, because of (20). Equally, however, the 
functions D} s , are not uniquely defined so far; for any given r, the value of <f> r 
given by (23) is unaffected if each D} s or is increased by the same amount. To 
make D} $ and D}} definite, suppose that for a special value s' of s 

- fys' - 0 - Dll. 



462 


T. G. Cowling 


Suppose also that (20) is used to express d s - in terms of the other vectors d s . These 
other vectors are independent; thus one can afterwards equate coefficients of d a 
and Hhd s on opposite sides of (29). This gives 


i- J»>C r (^i{i-ii)g) + S4<DJ,C r +Z>kC 5 ). 

Now put A rs - Dj a +iHDj}, A s = Lj + iHL] 1 . 


(30) 

(31) 


Then equations (30) can be replaced by the complex equation 

~f < 'r ) C r {S rs —8 rs ) = iHf?C r ( -J^Ag+^-A^ + S I^Cr+AM. (32) 


In this, q and r can take all values; s takes all values other than s'. 

Only a first approximation to the solution of (32) will be considered, similar to that 
yielding (6) for a binary gas. This is found by regarding A r$ as a quantity independent 
of C r , to be determined, not from (32), but from the equations obtained by taking the 
scalar product of m r C r with (32), and integrating over all values of c r . In the notation 
of Chapman & Cowling (1939, p. 86, equation (9)), the resulting equations are 

3 JcT(8 rs -8 r f) = ZiH(-P±A s +^n r kTA r \ + £ n q n r [m r C r , A rs C r +A qs C a ] vr . (33) 
\ P m r J Q 

Also from (24), taking A rs (and therefore D* s , D**) as independent of C r , 


n r Z! r = S (Dld s +D%HAd s ). (34) 

rrt r s 

Remembering that nd s ~ dnjdr for an inhomogeneous gas at rest under no forces, 
it is seen that p r JDj s /m r , Hp r D}ljm r may be regarded as generalized diffusion coeffi¬ 
cients for direct and transverse diffusion. 

The collision integral appearing as the last term on the right of (33) can be evalu¬ 
ated, using the formulae of Chapman & Cowling (1939, § 9-8). The result is 

* 16 hT 

\m r C r , A rs C r 4- A qs Cq] qr = — ^ ^ ( m q A rs —m r A (1) 

^ (Ars A os ) 

(n q +n r )D qr \m r mj’ 

where would be the ordinary coefficient of mutual diffusion of the ^th and rth 
gases in the absence of the other gases. This result can be expressed in terms of an 
effective collision-interval r qr for molecules of the gth and rth gases (similar to the 
r of (8)), such that 

_ m q m r (n q +n r ) 

Qr ~ (Pq + Pr)W 


(35) 



Electrical conductivity of an ionized gas in a magnetic field 
It is, however, more convenient to use 6 qr , defined by 

PqPr 


6 , 


Uqn r kT 


Then 

and equation (33) becomes 


iPq "b Pr) Tq r (liq ■+■ n r ) Dq r 

( 


[m r C,, A rs C r + A iS CJ„ = — 8, 


A rs ^gs 


n n n. 


a‘ v r 


m r m q 


:) 


If this relation is summed over all values of r, it is found that 

A, = IcT 2 e r n r AJm r . 


463 

(36) 


(37) 


(38) 


This equation could also be obtained directly from (26) and (31). 

In addition to (37), further equations are satisfied by the quantities A rs , in virtue 
of the momentum equation (10). Substituting for E r from (34) into this equation, 

0 = ^p r AJm r . (39) 


Equations (37) and (39) suffice to determine the quantities A rs and A s . The deter - 
minantal solution is of the form 

HAJ(pkT)~ i{& N+1 , s -& N+1 , a W, (40) 

AJm r = (41) 

Here JSf is a symmetric determinant with N +1 rows and columns (N —number of 
gases), the elements l rs of which are given by 

Ks = ~6rs + <*«.( 2 0 qr + e r n r iH) (r,s^N),] 

a 1 (42) 

I'N+l.T ~ h,N+ 1 — Pr { r ^=N)> ^N~l, A'+l = 

and Jf rs is the cofaetor. of l rs in ££. 

Equation (41) gives the formal solution to the problem of diffusion, since the 
generalized coefficients of diffusion p r D}Jm r , Hp r D^jm r are the real and imaginary 
parts of p r A r Jm r . Similarly, (40) formally gives the electric currents (cf. (26), (31)). 
The direct and transverse conductivities are given by 


<r I + i(r 11 = (kT)- 1 £ n s e s A s 

S 

~ HS?’ 


(43) 


where is got from by replacing the terms p r of the last row by n r e r 



464 


T. G. Cowling 


7 . Ternary mixture 

For a mixture of three gases, the determinant & becomes 


& = 


n^iH + 8 12 + d 13l 


■6 


12 ) 


-o. 


13 > 


Pi 

-<^ 12 , e 2 n. 2 iH + d 23 + d 12 , -<9 23 , p 2 


-6 


13 ) 


$23> e 3 n 3 iH + d 13 + $ 23 , p 3 

Pl ’ /’a: p 3 , 0 , 

= B 2 Sp\e 2 n 2 e 3 n 3 - iHpZd^n^p - 2 Pl )~ pH6 12 d 13 + 0 23 d n + d 13 6 i3 ) ( 44 ) 

^iveJby COfaCtOTS ° f the t6rmS ° fJf ° an be foUnd - The determina ^ 

”^e = B 2 pn 1 n 2 n 3 e x e 2 e 3 + iHpHn\ ef (9 2 3- (45) 

cir^ e fl g ^ neral ® x ? ressions for tiie electric currents and the conductivity can be 
simplified incertain special cases, which will now be discussed in detail. 

, ., Sll f tly Wmzed gas - ® u PP ose that the first gas supplies nearly all of the 

“c^ll ° f ^ miXtUre - SinC6 tb6 «“ - • whole is supposed 

to be electncally neutral, this implies in practice that e 1 = 0,» 2 e 2 = e , Suppose 

atTuZ’ ^ at , nearly f the comsion s of molecules m 2 , m 3 are with molecules 

radius forZ ^ aS & se P arate assumption, because the effective collision- 

radxus for the collisions of pairs of charged molecules is very large). Analytically 

is ^^ 

& = H 2 p*n 2 e 2 n 3 e 3 - iHp 2 (e 2 n 2 0 la + e 3 n 3 6 12 ) - p W 12 6 1Z , 

K = ~ iHpn 2 e 2 n 3 e 3 (0 12 + d 13 ). 


Hence 


2*2 


o-i + aVii^_ thH n\e\ 

A i * rr _ i~ 


rc.2 *2 


0 12 4 - iHe 2 n 2 + 0 13 + iHe s n z 9 
or, since (36) now gives 0 M = p 2 /r 12 , d 13 = p 3 / Tl3 , 

O-I + icrll = _ n 3 e l T 12 . n 3 e 3 T 13 

«i 2 (l+*<y 2 T 12 ) m s ( l + i ( o 3 T 13 )‘ (46) 

2Z%L° 0ndUOti '? y °“ be dWded parts, one arisingta„ 
fcnfZStTt^Tf “T 1 ; (46) f' “ fect - ideI “ ^ ‘te result obtained 

correct conductivity can be deduced from (4) This is to heevn^JL § ’ , 

S.* Z * ^ transfer of diffusion-velocity from one g£ to another at 
^Me ®boiwt e / KPO °t : T g>S Pr ° rtd « ”°“* <* «*»». and has a 



Electrical conductivity of an ionized gas in a magnetic field 465 

When diffusion is due to factors other than simply an electric field, the electric 
current is given by 


e 2jP r i a ff If ^ ^ 2 ) 13 (^3 H kfc) 3 TX 3 H A dg) 

1 +w|r| 2 1 + w§t| 3 


where df 2 , d 3 are given by (21). This equation also is very similar to free-path results. 

Case II. Electrons present. Consider a mixture in which the molecules of the first 
two gases are different kinds of ions, while those of the'third are electrons, the 
number-densities of the three gases being similar in order of magnitude. Only the 
lowest powers of the small quantities m z jm v m 3 /m 2 in our expressions are retained. 


In (36), 


(n q + n r )D qr cc 



Ogr °c %n r 



Hence d 12 is large compared with 0 13 or d 23 , but small compared with p 2 d ls lp 3 or 
Pi# 23 //} 3 . Thus approximately 

if = H 2 e s n x n z n 3 (e 2 p 1 m 1 +e 1 p 2 m st ) — iHp 2 n 3 e 3 6 1Z — p a d 12 (d IS + 0 Z3 ) 

4= — (^i 3+^23 + iHe 3 n s ) {p 2 d n +iHn 1 w 2 (e 2 p 1 m 1 +e 1 p 2 m 2 )) 

£& e = H^pn^ei w 2 e 2 ?i 3 e 3 + iHpn 3 e 3 0 Z2 

== iHp~ x {nlel(pW n + %En x n 2 {e 2 p x m x + e^p 3 n 3 )) 

+ n x n 2 (e x nt 2 [9 X3 + 0 23 + iHe z n 3 )}, 


and so o- J + ia n 


«|e| 


+ 


wfw.|(e 1 w 2 -e a m 1 ) 2 


0\ 3 +6 n + iHn 3 e z p 2 d n + iHn x n z (e i p x m x +e x p 2 m 2 )' 


(48) 


Thus the conductivity can again be divided into two parts, one due to the diffusion 
of electrons relative to ions, and the other to the motion of one set of ions relative 
to the other. The electron conductivity does not depend on the mutual collisions of 
ions; since here approximately d 13 = p 3 /r 13 , Q %z = p 3 /r 23 , it is given by 


orj + iof = 


_ nz4 

m^r^+r^+iHef 


which is precisely the expression given by free-path theory. The ion conductivity 
is unaffected by electron collisions; if poj 12 — p 1 w 2 +p 2 ciJ 1 , it is given by 




pmxWa(l+«<>iaTia) ’ 


which is precisely the conductivity found for a binary gas from (6). 

Similarly, when Other factors besidqthe electric field produce diffusion, the total 
electric current is the sum of independent ion and electron contributions. In fact 


e 3 p ((vxs^+ d z E~ x co z H a d 3 ) 
m 3 ((t5 1 +t5 1 )®+<»|) 

(m 2 e 2 —m 2 e i) T ^1 ~ Pi ^2 ~ ^^12 T 12 ^ A (£2 ~ PiAfl ) ^ 

mim 2 p(l + <t>faTfa) 



466 , T. G. Cowling 


where in the present case, by (21), 

<*3 = ^( 1 ?-%^), ( 50 ) 


Pz ^1 Pi ^Z 


PlPz( ld Pl e l £ 1 8p» { 

\pi 8r m x p 2 dr m 2 / 


(51) 


Equations (48) and (49) cannot always be applied to a mixture in which one gas 
—say the first—has uncharged molecules. The effective collision-radii are much 
greater for pairs of ions than for ions with uncharged molecules, and so in this case 
it is permissible to nelgect # 23 /<9 12 only when the uncharged molecules are numerous. 

Generally, for a mixture of N gases, when electrons and ions of both signs are 
present, the conductivity and current can be divided into independent electron and 
ion contributions. The electron contribution can be calculated from free-path 
formulae; the collision-interval to be used in these formulae is that given by the first 
approximation of § 6, somewhat increased where necessary, in the way indicated 
in § 4. The accuracy of the first approximation is, however, usually adequate when 
a large number of neutral molecules are present; not only is the effect of electron 
collisions with ions ‘diluted’ by the presence of the neutral molecules, but the 
correction to the effective electron-ion collision-interval is reduced by the collisions 
with the neutral molecules. Ion conductivity cannot in general be expressed by 
free-path formulae (uncorrected) for the reasons noted in § 3. When a large number 
of neutral molecules is present, it can be so expressed; but when an adequate 
number of ions is present, the mutual collision of pairs of ions becomes important, 
and reduces the total conductivity below the sum of the (free-path) contributions 
of the different sets of charged particles present. 


8. Solar applications 

Sunspots possess magnetic fields of order 2000 gauss; the sun is also stated to 
possess a general magnetic field of order 25 gauss. The spot field produces marked 
changes in the conductivity in the surface layers. 

In the sun, the contribution of ions to the conductivity is always smaller than that 
of electrons, and is comparable with it only in extreme cases, namely, when the 
pressure is low and the magnetic field large. Since in any case the conductivity then 
is small, the ion conductivity may in practice be ignored. 

To study the electron conductivity, it is convenient to introduce a mean collision 
distance for molecules of the rth and sth gases. This is defined as the sum of the radii 
of elastic spherical molecules of the same masses, giving the same collision-function 
6 rs and collision-interval r rs . The inverse-square law of interaction of electrons with 
ions gives an expression for 6 rs depending on a slowly varying function A x ( 2 ) of the 
density and temperature (Chapman & Cowling 1939, p. 179). In the sun’s atmo¬ 
sphere A x { 2) lies between 10 and 11; we here use the mean value A x {2) = 10-6. Then 
the mean collision-distance for electrons with Z-fold ionized atoms is 

4*7 x 10~~ 7 Z(5 f I5Q/T) cm. 



Electrical conductivity of an ionized gas in a magnetic field 467 

As mobility measurements indicate, the collision-distance for electrons with neutral 
molecules is probably not greater than 2x 10~®cm. Hence, unless neutral atoms 
are, say, 100 times as numerous as ions in the sun’s atmosphere, it is permissible to 
neglect collisions with neutral atoms. In fact, neutral atoms (hydrogen and helium) 
are usually supposed to be only about 30 times as numerous as ionized atoms (metals) 
in the surface layers of the sun’s atmosphere; they disappear completely at a depth 
of a few hundred km. below the visible layers. 



log 10 T 

Figure 1 . Graphs of log 10 a 1 (continuous curves) and log 10 <r“ (broken curves) against 
log 10 T for certain values of H/p e shown against the corresponding curves. 


Neglecting collisions with neutral atoms, a general formula for the (electron) 
conductivity may be given. Suppose several types of ion are present; let the number- 
density of the rth type be n r> and let it be Z r times ionized. The mean degree of 
ionization Z is defined by __ 

Z = Zn r Zf/Zn r Z r (52) 

In terms of this 

(r 1 +i<r 11 = {6-8 x 10 13 ZT~ i — i8‘6x 10 s HT/p e }~ 1 e.m.Ti. (53) 


where p e is the electron-pressure in dynes/cm. 2 . The values of or 1 and cr 11 calculated 
from (53) for surface layers (Z = 1) are shown graphically in figure 1. 

According to (53), cr 11 is smaller than cr 1 , and cr 1 is not very greatly reduced by the 


magnetic field, if 


6-8 x 10 13 Zpg 
• 8-6 x 10 s T* 


7-9x10* 


ZPe 

JTf 


gauss. 




468 


T. G. Cowling 

For example, if Z = 1, p e = 40 dynes/cm. 2 , T = 5750° K (taken as typical of the sur¬ 
face layers), this gives H < 122 gauss; if .Z = 1, = 2 x 10 4 dynes/cm. 2 , T = 17,000° K. 
(taken as typical of the layer in which hydrogen ionization is nearly complete, say 
400 km. lower), it gives H < 4200 gauss. Hence, if the magnetic field in the interior 
is never much greater than at the surface, its effect on cr 1 is likely not to be large 
at a depth of more than 1000 km. below the sun’s surface. Even at the surface, the 
effect on cr 1 is not large save near a sunspot; but near the centre of a spot cr 1 may be 
only a fraction of 1 % of its value in the absence of the magnetic field. 

At great depths, and in general when cr 11 is small compared with cr 1 , approximately 

cr 1 = 1-47 x 10~ u J 1? /Ze.m.u. (54) 

The values given by (53) and (54) are first approximations. When a 11 is larger than 
cr 1 , the first approximation involves little error. When cr 11 jcr 1 is small, the first 
approximation can be roughly corrected by replacing Z by Z', where the following 
values of Z' are inferred from the second approximation quoted in § 4: 

Z 1 1*5 2 2-5 3 4 5 6 8 10 15 

Z' 0-51 0-67 0-83 0-98 1-14 1-44 1-73 2-02 2*60 3-18 4-65 


Thus in surface layers, away from a spot, cr 1 is of order 10~ 8 e.m.u.; at a level 500 km. 
lower, where hydrogen ionization is just complete, cr 1 is : of order 10~ 7 e.m.u.; at the 
centre of the sun it is of order 6 x 10 -4 e.m.u.; and half-way along the radius, of 
order 4 x 10~ 5 e.m.u. (taking Z' — 3 in the last two estimates). 

In addition to conduction currents due to an applied electric force, currents arise 
in general whenever diffusion occurs. Electrons normally contribute the major part 
of these, as of conduction currents. Their magnitude can be found from the con¬ 
ductivity results by regarding the electrons as subject to an equivalent electric 
force E', where 




JcTdpe 
ep e dr ’ 


— e being the electronic charge (—1-59 x 10~ 20 e.m.u.). 


9. Mechanical effects 

The conductivity is, however, not the most important factor determining the 
currents which flow in the sun’s outer layers. Because of the currents induced in- 
it, a mass of gas, moving with velocity c 0 across the lines of force, is subject to 
a retarding force — cr I H 2 c 0 per unit volume. This destroys the momentum pc 0 
in a time of order p{cr 1 H 2 )~ 1 . In the first thousand km. below the sun’s surface, 
p/cr I < 10 g.cm.~ 3 /e.m.u.; hence, if H> 10 gauss, the time of decay of the motion is 
a fraction of a second. The motion of the gas can persist only if ( a ) it is maintained 
by an accelerating force (including, as a special case, a pressure gradient), or 
(6) the gas is polarized by its motion in the magnetic field, and an electrostatic 
field is set up, which'neutralizes the field c 0 aH; or (c) the interior of a moving 



Electrical conductivity of an ionized gas in a magnetic field 469 

mass of gas is shielded from the external magnetic field by currents induced in its 
surface layers. 

In case (n), the motion maintained is such that the force j due to the induced 
current just balances the accelerating force. Thus the current is deter min ed by the 
latter force; the conductivity merely determines the transverse velocity required 
to produce the current. This velocity is normally not large, as solar velocities go. 
Tor example, when <T n j<r l can be neglected, gravity g, acting across a horizontal 
magnetic field, produces a vertical velocity pg/H^cr 1 ; taking pjcr 1 < 10 g.cm.- 3 /e.m.u., 
H > 10 gauss, this is "found to be less than 30 m./sec., as against velocities of order 
1 km./sec. often met near a sunspot. Hence, unless polarization or electromagnetic 
shielding occur (cases (6) and (c)), motions in the outer layers of the sun are directed 
almost exactly along the lines of magnetic force. 

In cases (6) and (c), the full electric force does not act on the gas. Electric polariza¬ 
tion (case (6)) is able only partially to neutralize the field c 0 aH in the sun’s atmo¬ 
sphere; the space-charge produced* by polarization is dissipated through the sur¬ 
rounding material at a rate comparable with that at which it is produced. Thus 
polarization alone cannot greatly reduce the rate of decay of a motion across the 
lines of force. In the motion of a stream of ionized gas outward from the sun (assumed 
in certain theories of magnetic storms) the space-charge is dissipated less readily, 
because the stream is not surrounded by conducting material. 

Electromagnetic shielding (case ( c )) is important for a large mass of moving gas. 
If a mass of gas, in the interior of which there is no magnetic field, moves in a region 
where the field is H , the currents induced in its outer layers imply a retarding pres¬ 
sure, comparable with the electromagnetic stress H 2 /8n , over its boundary. In unit 
time this destroys momentum equal to that of the gas within a distance H^irpcf)" 1 
of the boundary. The distance in question is 400 km. if H = 1000 gauss, 
p = 10~ 8 g./cm. 3 , c 0 = 1 km./sec., and so in this case motion across the lines of 
force would persist for only a few seconds even if the dimensions of the moving mass 
are of order 1000 km. That is, electromagnetic shielding is relatively unimportant 
in the surface layers near a sunspot. At great depths, or in regions where the mag¬ 
netic field is small, however, shielding can be effective for relatively small moving 
volumes. It is, of course, not necessary to suppose that the magnetic field vanishes 
in the transported gas; this may carry its own magnetic field with it in its interior. 
Thus, for example, the field of a sunspot might be carried to the surface from a deep 
level by the convection of the sunspot material. 

To summarize this discussion, motions across the lines of force can persist below 
the sun’s surface because of electromagnetic shielding, and isolated clouds well 
above the surface may be protected from magnetic retardation by polarization. 
But in the surface layers, especially near sunspots, motion must be nearly along the 
lines of magnetic force. This undoubtedly is why prominences often end their 
existence by being sucked into a spot; why, in eruptions near spots, the ejected 
(largely ionized) gas is often seen, on slow-motion cinema records, to travel up and 
return down the same curved path; and why, in such eruptions, the gas shows little 


Vol. 183. A. 


31 



470 


T. G. Cowling 

tendency to expand perpendicular to its direction of motion, in spite of thermal 
velocities of the atoms of order 10 km./sec. It is probable that the persistent forms 
of quiescent prominences are to be explained similarly. 


10 . Application to the earth’s upper atmosphere 

The conductivity of the extreme upper atmosphere (above a height of 80 km., 
say) is important in considering the solar and lunar variations of the earth’s magnetic 
field. To discuss the conductivity adequately, it is necessary to consider a mixture of 
four gases—neutral molecules, positive ions, negative ions, and electrons. These are 
distinguished respectively by the suffixes a, + 3 —, and e. The molecular weights of 
neutral molecules and ions are both taken to be 26. If oxygen is largely molecular, 
this choice of molecular weights results in an overestimate of the ion conductivity 
(the coefficient of mutual diffusion of molecules of masses m r , m s is proportional to 
{m r - 3 rm s )^m^m~*)\ if oxygen is atomic, there is an underestimate; since nitrogen 
is probably present in large quantities, the error should not, however, exceed 10 %. 

The quantity A x { 2) involved in electrostatic interaction is rather larger than in 
solar applications, because of the much smaller number-density of charged particles. 
Taking A-^2) — 17*5, it is found that 

6Jn + = &_Jn_ = 2*84 x 10 -™n e T-*, = 4*37 x 10-™n + n_T-K (55) 

This corresponds to a mean collision-distance 1*16 x 10"“ 5 (300/T) cm. for pairs of 
charged particles. 

For neutral molecules with ions of either sign, the collision-distance is assumed 
to be 5*9 x 10~ 8 (300 /T)* cm. (suggested by the observed mobility of N£ ions in A 2 ; 
Tyndall 1938). For electrons with neutral molecules, the collision-distance is taken 
to be 2 x 10~ 8 cm., as suggested by the observed mobility of electrons in air (Town¬ 
send & Tizard 1913; this work, however, chiefly referred to electrons with velocities 
much greater than thermal). Then 

6 a Jn + = = 2*25 x 10~ 3 X, 0 ae = 9*35 x 10 -™n a n e TK (56) 

The maximum number-densities of electrons in the E, F x and F 2 layers, at noon 
at the equator near sunspot maximum, are about 2 x 10 6 ,3*5 x 10 5 ,andl*8 x 10 6 /cm. 3 . 
The ratios n_/n ei njn e are denoted by A, 1 + A; it is usually supposed that A is about 
100 in the E-layer, much smaller (though probably greater than unity) in F x , and 
less than unity in F 2 . Massey & Bates (1943) have recently suggested that A is less 
than unity, even in the E-layer; but they consider only atomic oxygen, and molecular 
oxygen is likely to be important in the E-layer. 

The number of neutral atoms is normally estimated from the effective collision- 
interval r of electrons, as inferred from radio reflexion coefficients. Observations in 
England suggest a collision-frequency r -1 about 6 x 10 4 /sec. at the centre of the 
E-layer, 4 x 10 3 /sec. for F l5 and 2xl0 3 /sec. for F 2 ; the corresponding number- 



Electrical conductivity of an ionized gas in a magnetic field 471 

densities of electrons are, say, 10 5 , 2-5 x 10 5 and 5 x 10 5 /em. 3 . The collision-interval 
so found is precisely that effective in diffusion; it is given by 

T-1 = (®cte + 0+c + @-e)l m e n e 

= T 8 x 10 ^.( 555 )*+6-06 x 10-»(» + + »_)(^y. (57) 

In spite of the relative fewness of ions, these must not be neglected in considering 
collision-intervals; because of the large collision-distance for ions with electrons, 
each ion makes the same contribution to t -1 as/neutral molecules, where 


6-06xl0“ 3 /300\ 2 
1-8 xlO - 8 \ T ) 


3*37 x 10 5 



For example, the values of r-~ x quoted above for E, F x and F 2 can be explained, 
neglecting collisions with neutral molecules completely, by assuming the number- 
densities of ions in the three layers to be 

10, do) 1 ' 6 -® XI 05 (Jr»)'’ an ® 3 - 3xl0 ‘©' /cm ‘- 

These are of the same order as the usually assumed ion-densities. 

However, only a small part of the collisions of electrons is likely to be with ions 
in the E-layer. During the day, the logarithm of the reflexion coefficient of wireless 
waves traversing the E-layer varies like eos*x, where x * s ^e sun’s zenith angle; 
whereas, if collisions with ions were dominant, it would vary like cos%. Hence the 
number of ions in the E-layer must be much less than 10 7 (2 7 /300)§/cm. 3 . This may 
come about either because there actually are few ions or because T is very large 
(here T is the electron-temperature, not the temperature of the gas as a whole). 
Of these alternatives, the first appears the more probable. At any elastic collision 
with a molecule, a fast electron loses a fraction of its energy of order mjm a , i.e. 
(5 x 10 4 )- 1 . It undergoes 6 x 10 4 collisions per sec.; thus its energy is reduced in the 
ratio 1 /e in f sec. According to Massey & Bates (1943), the average free lifetime of an 
electron before attachment to a neutral atom is probably at least of order 30 sec. 
Hence, even though when first liberated its energy may be large, an electron is 
likely to possess an energy not much greater than the thermal throughout most of 
its free existence. 

Since the gas temperature in the E-layer is not likely to differ greatly from 300° K, 
we conclude that the total number of ions must be much less than 10 7 /cm. 3 , i.e. much 
less than 50 ions of either sign per electron. To explain the observed collision- 
frequency, the number-density of neutral molecules must be about 3*5 x 10 12 /cm . 3 
at the centre of the E-layer. This corresponds to a pressure -§■ dyne/cm. 2 , or 10” 4 
mm. Hg. 

If Massey & Bates are correct in suggesting that there are few negative ions in 
F x , only a relatively small fraction of the electron-collision in this layer can be with 
ions. Neglecting ionic collisions completely, and taking T = 900° K, we find that 

$ 1-2 



472 


T. G. Cowling 

n a = 1-3 x 10 u /cm. 3 . Massey & Bates suggest 10 u /cm. 3 , using the absorption 
coefficient of atomic oxygen. For F 2 , a large proportion of the electronic collisions 
may be with ions, and the collision-frequency does not, therefore, give any reliable 
information as to the neutral-gas density in this layer. The estimates for E and F x 
refer to England; at the equator, where ionizing radiation penetrates deeper, the 
densities may be rather greater—say 

n a = 4-5 x 10 12 /cm. 3 for E, and n a = 1-6 x 10 u /cm. 3 for F. 


11 . Electron and ion conductivities 
The electron conductivity (in e.m.u.) is given by 


al+iof = nle*l{d ac + 6 +e +d_ e -iHn e e} 

= n e e 2 l{m e 7- 1 -iHe} 

— w e {3-57 x 10 12 r -1 — iH . 6-29 x 10 19 } -1 . (58) 

Thus of and cr| are equal if the collision-frequency r" 1 is 1-76 x 10 7 H/seo. This is 
at a height of about 85 km., where the total pressure is, say, 100 times that at the 
centre of the E-layer; here n e and the conductivity are not likely to be large. In the 
E and F layers, cr* is small compared with up-, and approximately 

°i = 9 ' 04 X 10- 2 Xfi'- 2 r- 1 , crp = 1-59 X 10 ~™n e H~\ ( 59 ) 

The maximum value of a\ is reached rather below the centre of the E-layer- at 
midday 1 at the equator it is of order 1-5 x 10- 19 e.m.u. In the F-layer, <rj is probably 
never greater than 5 x 10- 17 e.m.u. The transverse conductivity up is larger; its 
maximum values in the E and F layers are about 10 -14 and 10~ 13 e.m.u. 

To obtain a general expression for the ion conductivity, the ternary-gas formula 

°f US6<i ’ ® UJ)pose tiiat ^ ( sk -K) is fairly large compared with unity; 

then the difference between n + and n_ can be neglected in calculating crj, but not in 
calculating of (which vanishes if n + = n_). If we put n + = n_, 

a i = -H0+- + \d a+ )nf i e- i }- 1 

= {2-22 x 10 31 ^~ +3-32 x 10 12 ^j*+4-44 x 10 7 Jjj _1 . (60) 

Numerical values of u£ when T = 300° K are illustrated in figure 2. When njn + is 
greater than 10 , the collisions of ions of opposite signs are unimportant; as a con¬ 
sequence, the lower curves in figure 2 are nearly identical in form. The flattening 
shown by the upper curves is the direct consequence of collisions of pairs of ions 
(represented by the second term in the bracket in (60)); because of such collisions, 
cr\ can never exceed 3 x 10- 13 (7 T /300p e.m.u. 

When the mutual collisions of ions can be ignored—when njn + is greater than 10«, 
or there are few negative ions—<rf approximates to the sum of the conductivities 



Electrical conductivity of an ionized gas in a magnetic field 473 

due to the positive and negative ions separately, given by the uncorrected free-path 
formulae. Hence in this case 

or} + iof- = n% e 2 (6 a+ +in + eH)~ l +nt e 2 (6 a _—in_ eH)- 1 

= % + {8-89 x 10X+i6-29 x 10 19 #}- 1 
+ n_{8-89 x 10 7 w o —i6-29 x 10 19 #}- 1 . (61) 

This shows that erl 1 is numerically less than or} 1 , and opposite in sign to it. When 
n a is less than 5 x 10 u /cm. 3 , a} 1 becomes comparable with cr} 1 , and it probably 
cancels most of a} 1 in the F-layer. 



logio (nJH) 

Figure 2. Graphs of log 10 t 1 against log 10 ( nJH) for certain values of nJE 
(shown against the corresponding curves). 

Save at the base of the E-layer, a} is large compared with or}. Even if the number 
of negative ions is negligible, cr} is 5 x 10 -16 e.m.u. at midday at the equator at the 
centre of the E-layer, and with A negative ions per electron (A <§ 50) it is nearly 2A +1 
timftg as large. Were it not for the increase in dissociation of molecular oxygen with 
height, or} would take larger values throughout a thick layer above this level. 
According to the Chapman theory of layer formation, assuming no negative ions, 
n e is roughly half its maximum value at a height at which n a has decreased in the 
ratio yq-; this mak es cr} — 2 x 10 -1S e.m.u. at a level close to the base of the F^-layer. 
However, the E-layer arises from the ionisation of molecular oxygen; because of 
increasing dissociation of oxygen the conductivity of the E-layer is limited to 




474 


T. G. Cowling 


a thickness which, for few negative ions, may be estimated as, say, 50 km. If a 
large number of negative ions of molecular oxygen are present, the thickness is likely 
to be smaller. For different values of A, the integrated conductivity of the layer, 

J cr l dh (h = height), 

is estimated as shown 

A = 60 20 10 5 2 0 
Jcr^dk = 3 x 10- 7 1*4xlO- 7 8xl0-« 4-5xl0-« 2xl(H 5x10"® 


These estimates are, of course, very rough; they provide little more than an idea 
of the order of magnitude. 

In the Fj-layer, the ion conductivity is appreciably affected by the magnetic field 
Assuming no negative ions, <rj is 8 x 10~ 15 e.m.u. at the centre of the layer, and 
similar m order of magnitude throughout much of the lower part of the layer; if 
there are an appreciable number of negative ions, of is multiplied by 1 + 2A. At the 
centre of F 2 , assuming n a = 4 x 10 10 /cm. 3 , it is found that cr| = 1-6 x 10~ 14 e.m.u. 
Thus the integrated conductivities of the two layers, assuming widths 50 and 60 km 
respectively, are 4 x 10~ 8 (at least) and 10~ 7 e.m.u. 

It would appear from this that ‘dynamo’ currents due to tidal motion in the 
earth s atmosphere are not unlikely to flow mainly in the F-layer. This conclusion 
is, however, questionable, because of the drag due to induction. A motion across the 

Tjf.® ° f once started > tends to decay in a time p(o-iff 2 )-i (cf. §9); this is 

/( A+1) days at the centre of the E-layer, 3£/(2A+ 1) hours at the centre of F, 
and only 20 min. at the centre of F 2 . 19 

equatten m ° ti0n ** Compared with the force d harmonic motion given by the 

x+fi±+cr 2 x = Ae ier ' 1 , 

the solution of which is x = Ae i<T ' l /(cr 2 —cr' 2 + filer'). 

18 the ab0Te time of decay; the periods 2t ir/tr, 2n/cr' of the free and forced 
oscillations are not very different (Pekeris 1937). Assuming that ft and <r are com- 

°K a I s Smallj the piiase ° fthe forced oscillation differs from that of the 

in£n!m 7 T a qUart6r ° f a peri0d ’ and its am PMe is reduced, by the 
inclusion of the resistance term fix, roughly in the ratio 

{<r*-cr' 2 )/ficr' = 2 {cr-<r')j/i. 

wTS 7 11 ^ ^ " 12 < to ** correct order of 
h tldal motlons “ P i are reduced in the ratio 1/3(2A+1), and 

induced f 2 ” f If by *** on the magnitude of the 

f“ * CUrr ? nt !. “ rou § hl y the same aa ^ ^e tidal motion were unaltered, but the 
tegral conductivities of the two layers were reduced to 1-3 x 10~ 8 and 3 x 10~® 



Electrical conductivity of an ionized gas in a magnetic field 475 

e.m.u.; the phase of the currents is altered roughly by a quarter of a period. 
By the same argument, an increase of A above 50 in the E-layer, if such were 
possible, would not produce a great change in the magnitude of currents induced 
in this layer. 

The arguments set out in § 10 suggest that A cannot exceed (say) 25 in the E-layer. 
Thus the current flowing in the E and E layers together are the same in amplitude 
as if induction drag were negligible, and the integral conductivity of the two layers 
taken together were between 2 x 10~ 7 and 2 x 10~ 8 e.m.u. The major part of this 
effective conductivity is contributed by the E-layer, unless A<2 in this layer. 
The currents are in phase with the tidal oscillation, or a quarter period out of phase, 
according as they flow chiefly in the E or E layer. 

The effective conductivity thus obtained is smaller than earlier estimates; e.g. 
Chapman & Bartels (1940) suggested an integral conductivity of order 10” 5 e.m.^i. 
The chief reasons for the difference are the smaller width through which the E-layer 
is supposed to be conducting, and the smaller assumed value for A in the E-layer. 
As seen above, even if A could be much larger than assumed here, the electric currents 
in the E-layer would not be much larger, because of the increased effect on the con¬ 
ductivity of collisions between ions of opposite signs, and because of induction drag, 
neither of which have been considered earlier. 

Chapman found that, to explain the lunar variation of the earth’s magnetic field, 
the integral conductivity of the atmosphere at noon at the equator near sunspot 
maximum must be about 2*5 x 10~ 6 e.m.u. if the speed of the tidal currents is the 
same in the upper atmosphere as at the ground (Chapman 1919); if it is l times as 
great, the integi'al conductivity is 2*5 x ICr 5 ^ 1 e.m.u. The difference between the 
conductivities parallel and transverse to the magnetic field was not realized at the 
time of Chapman’s paper; since the resistance of a circuit is determined chiefly by 
the conductivity of its more resisting parts, the integral conductivity transverse to 
the field is probably of order 2 x 10~ 5 Z~ X e.m.u. Hence the estimates given above 
suggest that l is between 100 and 1000. This is not unreasonable, in view of the size 
of the tidal oscillation observed in the E-layer (Appleton & Weekes 1939). 

The dynamo theory of the lunar variations in the earth’s magnetic field is subject 
to the difficulty that the observed lunar tide in the E-layer appears to be opposite 
in phase to that required to explain the magnetic variations. The present work does 
not resolve this difficulty; but it may help to explain certain other minor phase- 
difficulties. Since the currents flowing in the E and E layers differ in phase, differences 
in phase between the solar and lunar variations in the earth’s field may arise because 
of the different importance of the E and E layers in the two types of variation. Again, 
currents in the E-layer are chiefly determined by the balance between induction 
drag and tidal forces, and are nearly independent of*the degree of ionization, while 
currents in the E-layer are roughly proportional to the ion-density; hence seasonal 
changes in phase are likely if the contributions of the two layers are comparable in 
importance. 



476 


T. G. Cowling 

«12 . Diamagnetism and drift currents 

Beside the dynamo theory of solar and lunar variations in the earth’s field, other 
theories have also been examined, ascribing the variations either to the diamagnetic 
properties of ions and electrons, or to drift currents due to the combined action of 
gravitational, electric and magnetic fields. In another connexion (Cowling 1932) 
I have shown that the effects of diamagnetism and drift currents can also be described 
in terms of the electric currents arising from diffusion. These effects are accordingly 
discussed briefly here, even though Chapman & Bartels (1940) have set out weighty 
reasons why they are inadequate to explain the observed variations. 

If A is less than 25 in the E-layer, and small in the F-layer, the diffusion velocities 
of electrons, positive ions and negative ions are roughly independent. Using a formula 
fike (47), and substituting for d T from (21), the total electric current is found to be 


3 = Z(8l r +n*e*H*)~- 

-n*e*H* ( + *^)}, (62) 


where the summation is over positive ions, negative ions, and electrons. In this, the 
terms involving Ha dp r /dr represent diamagnetism and a drift current due to in¬ 
homogeneity in the magnetic field, and the terms involving HAdpjdr (which is 
approximately Hkpg) and HaE represent the drift currents due to gravity and the 
electric field. It is necessary to retain the E-terms; the relative diffusion of positive 
and negative charges produces a space-charge, whose field tends to oppose any 
further separation of positive and negative charges. Because of the horizontal 
stratification of the ionosphere, E will be vertical, as well as dp r /Br and dp/dr . 

Here it must be recalled that (62) applies strictly only when the factors causing 
diffusion are transverse to the magnetic field, i.e. when H is horizontal. If H is not 
horizontal, (62) gives correctly only the component of j transverse to H . Electron 
diffusion along the lines of force is unhampered by the magnetic field, and is much 
faster than ion diffusion. Thus, when H is not nearly horizontal, the electric field is 
due to electrons outstripping ions, and tends to reduce their diffusion velocity to 
equality with that of ions. That is, approximately, 


n e eE = - 


tye + Pety 

8r p 8r ’ 


save in a narrow belt, one or two degrees of latitude in breadth, about the magnetic 
equator. This means that the electronic drift-current due to E nearly neutralizes 
that due to gravity and the currents due to electron diamagnetism, save in this belt. 
In this belt, however, the vertical diffusion of ions is faster than that of electrons; 
the electric field operates so as to increase the diffusion of electrons, and to amplify 
ifche electron diamagnetic and drift currents. 



Electrical conductivity of an ionized gas in a magnetic field 477 

In the F-layer, the precise value of the electric force is unimportant. Since each 
6 ar is small compared with n r e r H here, the diamagnetic and drift currents are 
approximately 



^Pr.Pr^P 

0r p dr 


+n r e r E) 


w‘ Ha (s (? - +p - +ft >- e±± y ±£ 'l) 

i**(*H$- <*> 


since p_, p_ and p e are small, p + = p e and pjp = p + jp. To get an idea as to the order 
of magnitude of the total current, suppose that the variation of p e with p (not 
Necessarily with the height) is that for a simple Chapman layer. Then 


where P e is the value of p e at the level of maximum electron-density, and P is the 
corresponding value of p. Thus the currents (63) are 


— Ha—(—]* eitt-PlP')— 

H 2 P\P) dr ' 

Integrate with respect to height, neglecting the variation of H with height. Then if 
9 is the magnetic dip, the resulting integral current is of magnitude 

eM-P^dp = J(2ne)H~ l P c sind = 4-13#-^ sin 9. 

If the currents are supposed to flow in a thin current-sheet, the field they produce 
just below this sheet is 


2n x 4-1 ZH~ x P e sin 9 — 23 H~ x P e sin 9. 

Taking P e — 2-5 x 10~ 7 dynes/cm. 2 (corresponding to 1-8 x 10® electrons/cm. 3 , and 
a temperature 1000° K), this gives a field at noon near the equator at sunspot 
maximu m of order 2 x 10 -5 gauss = 2y. To the order to which we have worked, 
the term of (63) involving dpjdr (the diamagnetic term) does not contribute to this 
field; the integral diamagnetic current is zero, and diamagnetism affects the field 
only because forward and reverse currents flow at different heights. The effect of 
diamagnetism on the field is therefore much smaller than 2 y. 

In the E-layer, n + and n_ are probably at most of the same order as Vf + in F. In 
the E-layer, however, 9 a+ /n + and 9 a _jn_ are large compared with eH; thus, in the 
absence of an electric force, ion currents are likely to be much smaller in E than 
in E. The same is true also of electronic currents, since n e is much smaller in E than 
in E. The presence of an electric force does not greatly affect the situation. Except 
near the equator, the electric force increases the positive-ioh currents, but decreases 
the negative-ion and electron currents by a rather larger amount. Near the equator 
the reverse is the case; if there are no negative ions, the electric field effectively 
transfers the forces causing diffusion from positive ions to electrons, and thereby 



478 


T. G-. Cowling 

increases the currents, since the drift of electrons is much less impeded by collisions 
than that of ions. If negative ions are present, the effect is much smaller, and in 
any case the currents in E cannot be more than comparable with those in F. 

It is concluded therefore that diamagnetism and drift currents cannot produce 
a total field much greater than 2y. The resulting diurnal variation of the field will be 
of order 2y; semidiurnal changes must be much smaller. The inadequacy of the 
theory to explain the observed variations is therefore clear. * 

Appendix 

Boltzmann's equation and ionized gases. At first sight it would appear that tjgte 
Boltzmann equation does not apply to an ionized gas. This equation assumes that 
the motion of a molecule can be divided into relatively brief instants of encounter, 
separated by relatively long intervals during which forces of interaction with other 
molecules can be neglected in comparison with field forces. But in an ionized gas, 
because of the slow attenuation of electrostatic forces with increasing distance, 
interaction forces are everywhere at least comparable with field forces. For example, 
consider conditions in the reversing layer of the sun. The interaction force between 
two electrons r cm. apart is 2-25 x I0~ 19 /r 2 dynes; the force on an electron moving 
with speed 5 x 10 7 cm./sec. (the mean thermal speed at 5750° K) in a magnetic field 
1000 gauss is 8 x 10~ 10 dyne: these are equal only if r = 1*7 x 10~ 5 cm., which is not 
much less than the mean distance between neighbouring charged particles. Most 
field forces are, moreover, very small compared with the above magnetic forces. 

The position is saved by the fact that electrostatic forces, though large compared 
with field forces, are nevertheless not strong enough to produce large effects save at 
distances small compared with the mean distance between neighbouring charges. 
The mutual electrostatic energy of our two electrons is 2-25 x 10“ 19 /r erg; this equals 
the mean kinetic energy 2JcT of relative motion only if r = 1*4 x 10~ 7 cm. Thus, for 
every electrostatic encounter producing a large deflexion, there are many producing 
small deflexions; the cumulative effect of the latter outweighs that of the former. 
But, in an encounter producing a small deflexion, the effects of encounter and field 
forces are roughly additive; that is,.the change in velocity can be represented by 
supposing that the molecules normally move under field forces alone, but that at a 
given instant their velocities are suddenly changed by the full effect of the inter¬ 
action forces. The motion then becomes of the type to which Boltzmann’s equation 
can be applied. 

It may appear surprising that, even though magnetic forces on a charge are not 
large compared with interaction forces, they dominate the interaction forces in a 
sunspot to the extent that conditions of * free spiralling 5 are approached. The reason 
again is that a single encounter rarely produces a sizeable deflexion, and successive 
encounters produce deflexions in different directions. Thus the magnetic field, acting 
consistently in the same sense, is able to build up a large deflexion sooner than the 
random efforts of encounters. 



Electrical conductivity of an ionized gas in a magnetic field 479 

In conclusion, I should like to thank Professor S. Chapman for his continued 
interest and helpful advice during the progress of this work. 


References 

Appleton, E. V. & Weekes, K. 1939 Proc. Roy. Soc. A, 171, 171. 

Chapman, S. 1919 Phil. Trans. A, 218, 1. 

Chapman, S. <fc Bartels, J. 1940 Geomagnetism. Oxford Univ. Press. 

Chapman, S. & Cowling, T. G. 1939 The mathematical theory of non-uniform gases. Camb. 
Univ. Press. 

Cowling, T. G. 1932 Mon. Not. R. Astr. Soc. 92, 407. 

Massey, H. S. W. & Bates, D. B. 1943 Rep. Prog. Phys. Physical Soc. Lond. 9, 62. 

Pekeris, C. L. 1937 Proc. Roy. Soc. A, 158, 650. 

Townsend, J. S. <fe Tizard, H. T. 19x3 Proc. Roy. Soc. A, 88, 336. 

Tyndall, A. M. 1938 Positive Ions. Camb. Univ. Press. 




INDEX TO VOLUME 183 (A) 

Absorption of radium (B+C) gamma-rays (Roberts), 338. 

Acetone, thermal decomposition (Smith & Hinshelwood), 33. 

Allen, D. N. de G. & Southwell, R. V. Relaxation methods applied to engineering 
problems, X, 125. 

Allen, D. N. de G., Southwell, R. V. & Vaisey, Gillian, Relaxation methods applied to 
engineering problems, XI, 258. 

Anniversary address (Dale), 243. 

Balmer spectrum, long duration in excited hydrogen * (Rayleigh), 26. 

Bell, R. P. The occurrence and properties of molecular vibrations with V{x) = ax*, 328. 

Bell, R. P. & Longuet-Higgins, H. 0. The normal vibrations of bridged X 2 molecules, 

357. 

Bhabha, H. J. & Harish-Chandra. On the theory of point-particles, 134. 

Bruce, C. E. R. The initiation of long electrical discharges, 228. 

Chadwick, J., May, A. N., Pickavance, T. G. & Powell, C. F. An investigation of the 
scattering of high-energy particles from the cyclotron by the photographic method, I. 
The experimental method, 1. 

Champion, F. C. Sc Powell, C. F. Application of the photographic method to problems 
in nuclear physics, II. The scattering of 8*8 and 13 MeV neutrons by protons, 64. 

Chang, T, S. A note on the Hamiltonian theory of quantization, 316. 

Continuous spectra in flames, role of atomic oxygen (Gaydon), 111. 

Cosmic-ray bursts, production by mesotrons (Mohr Sc Stafford), 54. 

Cowling, T. G. The electrical conductivity of an ionized gas in a magnetic field, with 
applications to the solar atmosphere and the ionosphere, 453. 

Cyclotron, scattering of high-energy particles (Chadwick Sc others), 1. 

Dale, Sir Henry. Anniversary address, 243. 

Daniels, H. E. The statistical theory of the strength of bundles of threads, I, 405. 

Dirac, P. A. M. Unitary representation of the Lorentz group, 284. 

Dissociation energy of CO, N 2 , NO and CN (Gaydon Sc Penney), 374. 

Electrical conductivity of an ionized gas in a magnetic field (Cowling), 453. 

Electromagnetic fields, electron swarms (Gabor), 436. 

Equation of state for solids (Furth), 87. 

Fibrous proteins, combination with acids (Gilbert), 167. 

Flow of gas through a nozzle with velocities exceeding the speed of sound (Fox Sc South- 
well), 38. 

Fox, L., Sc Southwell, R. V. On the flow of gas through a nozzle with velocities exceeding 
the speed of sound, 38. 

Furth, R. On the equation of state for solids, 87. 

Gabor, D. Stationary electron swarms in electromagnetic fields, 436. 

Gaydon, A. G. Continuous spectra in flames: the role of atomic oxygen in combustion, 

111 . 


[ 481 ] 



482 


Index 


Gaydon, A. G. & Penney, W. G. The dissociation energies of CO, N 2 , NO and CN, 374. 
Gilbert, G. A. The combination of fibrous proteins with acids, II, 167. 

Guggenheim, E. A. Statistical thermodynamics of mixtures with zero energies of 
mixing, 203. 

Guggenheim, E. A. Statistical thermodynamics of mixtures with non-zero energies of 
mixing, 213. 

Hamiltonian theory of quantization (Chang), 316. 

Harish-Chandra. On the removal of the infinite self-energies of point-particles, 142. 
Harish-Chandra. See also Bhabha & Harish-Chandra. 

Hinshelwood, C. N. See Smith & Hinshelwood. 

Janossy, L. Some aspects of the production of mesons and the barometer effect of 
penetrating showers, 190. 

Janossy, L. & Bochester, G. D. The barometer effect of penetrating showers, 186. 
Janossy, L. & Bochester, G. D. The transition effect of penetrating showers, 181. 
Jeffrey, G. A. The structure of polyisoprenes, I, 388. 

Long electrical discharges, initiation (Bruce), 228. 

Longuet-Higgins, H. C. See Bell & Longuet-Higgins. 

Lorentz group, unitary representation (Dirac), 284. 

Magnetic field, electrical conductivity of ionized gas (Cowling), 453. 

May, A. N. See Chadwick & others. 

Mohr, C. B. O. & Stafford, G. H. The production of cosmic-ray bursts by mesotrons, 54. 
Molecular vibrations with V(x) = act’ 4 (Bell), 328. 

Normal vibrations of bridged\X 2 T e molecules (Bell & Longuet-Higgins), 357. 

Penetrating showers, barometer effect (Janossy), 190. 

Penetrating showers, barometer effect (Janossy & Bochester), 186. 

Penetrating showers, transition effect (Jdnossy & Bochester), 181. 

Penney, W. G. See Gaydon & Penney. 

Photographic method, applications to nuclear physics, II (Champion & Powell), 64. 
Pickavance, T. G. See # Chadwick & others. 

Point-particles, removal of infinite self-energies (Harish-Chandra), 142. 

Point-particles, theory (Bhabha & Harish-Chandra), 134. 

Polyisoprenes, structure (Jeffrey), 388. 

Powell, C. F. See Chadwick & others. 

Powell, C. F. See Champion & Powell. 

Bayleigh disk, performance (Scott), 296. 

Bayleigh, Lord. Long duration of the Balmer spectrum in excited hydrogen, 26. 
Belaxation methods applied to engineering problems, X (Allen & Southwell), 125. 
Belaxation methods applied to engineering problems, XI (Allen, Southwell & Vaisey), 
258. 

Boberts, J. E. Absorption of radium (B-bC) gamma-rays, 338. 

Bochester, G. D. See Janossy & Bochester. 

Scott, B. A. An investigation of the performance of the Bayleigh disk, 296. 

Smith, J. B. E. & Hinshelwood, C. N. The thermal decomposition of acetone, 33. 
Southwell, B. V. See Allen & Southwell, 



Index 


483 


Southwell, R. V. See also Allen, Southwell & Vaisey. 

Southwell, R. V. See Fox & Southwell. 

Stafford, G. H. See Mohr & Stafford. 

Stationary electron swarms in electromagnetic fields (Gabor), 436. 

Statistical theory of the strength of bundles of threads, I (Daniels), 405. 

Statistical thermodynamics of mixtures with non-zero energies of mixing (Guggenheim), 
213. 

Statistical thermodynamics of mixtures with zero energies of mixing (Guggenheim), 203. 
Thermal decomposition of acetone (Smith & Hinshelwood), 33. 

Vaisey, Gillian. See Allen, Southwell & Vaisey. 


END OF THE ONE HUNDRED AND EIGHTY-THIRD VOLUME (SERIES a) 




I. A. R 1. 75. 


INDIAN AGRICULTURAL RESEARCH 
INSTITUTE LIBRARY 
NEW DELHI. 


Date of issue. 

Date of issue. 

Date of issue 















































S. c . p.—1/8/47 P. J.-3-5-48-2000 







